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Abstract 

This research addresses the problem of proving lower bounds on the complexity of alge- 
braic computations involving the floor operation. The model of computation considered 
is a computation tree with the set of basic operations { + , — , *, /, [-J , >}. The constants 
available to the computation are and 1, and every other constant needs to be generated 
explicitly. The problems that are considered may be broken down into the following 
categories: 

1. Functions of a single input. A general lower bound technique is developed for a class 
of functions that have as their input a single n-bit integer. A characterization of the 
functions to which this technique applies is given. The characterization is general, 
and applies to many natural functions, such as perfect square root (deciding if the 
square root of the input is integral or not) and computing the value of [log log x\ . 
Every function that with this characterization requires at least Q(\/logn) opera- 
tions. 

Upper bounds are given which use the floor function in non-trivial ways. The most 
surprising result is that the computation of 2 2 can be accelerated in the presence 
of the floor function. Using only rational operations (i.e. {+,—,*,/}), it requires 
O(k) operations to compute (by repeated squaring), and this bound is tight (for 
ratioual operations). In contrast, we show that with the use of the floor operation 
and an additional input greater than 2 2 , the number 2 2 can be generated in 
0(vA;) operations. Using the upper bounds developed, an 0(y/\ogn) upper bound 
for computing [loglogzj is given (where x is an n-bit integer). This upper bound 
matches the lower bound proved for this function. 



2. Functions of two inputs. A very natural example of a function that has two integer 
inputs is the Greatest Common Divisor function. Grotschel, Lovasz and Schrijver 
posed as an open problem the problem of showing that computing the greatest 
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Chapter 1 



Introduction 



It is very natural to define the complexity of an algorithm with respect to the number of 
operations that it performs. Computation trees give a way to formalize such a model. A 
computation tree, intuitively, has a set of basic operations, each requires unit time, and 
the complexity of the algorithm is measured with respect to this set of operations. (See 
Chapter 2 for both motivation and definition of the computation tree model.) 

Arithmetic operations, e.g., addition, multiplication, etc, are natural candidates for 
the set of allowable unit time operations. Deriving lower bounds for computations that 
involve arithmetic operations has received much attention. As one would expect, at first 
lower bounds were developed for limited sets of arithmetic operations, and with time, 
the results were extended to richer sets of operations. Probably the simplest model is 
one that has only comparisons, which is used to show an fi(nlogn) lower bound for 
sorting (see [KnuSl]). A considerably richer model is one that allows linear operations 
in addition to comparisons. Lower bounds for computations with linear operations can 
be found in [Rei71, Dob76, DL78, DLSO, Yao75, YR80]. As for rational operations, i.e. 
{+,—,*,/}, the first lower bound is [Rab72], which is an information theoretic lower 
bound. 
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inputs. If the size of numbers is not restricted then "too many" functions are computed 
with a polynomial number of operations. For example, any PSPACE-computation can 
be simulated in polynomial time [BMS81, PS76, Sim81]. Moreover, it is known that hard 
problems, e.g., factoring, can be solved in linear time [Sha79]. Restricting the size of 
the intermediate results to be polynomial in the size of the inputs ensures that strongly 
polynomial time is contained in polynomial time. 

Lower bounds techniques for computations that use the floor operation have been 
much less successful than for rational operations. Perhaps one reason for the lack of 
progress in this area is that this set of operations does not possess the "nice" algebraic 
properties that rational functions possess. 

Previously there have been a few lower bound techniques developed to handle the 
floor operation. In [BJM88], a computation tree that operates on real numbers and has 
the floor operation in its repertoire was studied. They show, using topological arguments, 
that there are certain classes of languages which can not be decided by analytic compu- 
tation trees. [JMW89], compares the expressive power of computation trees with various 
sets of operations, and proves lower bounds for computation trees with the operations 
{ + ,—,DIV c }, where DIV C denotes integer division by constants. Based on proof tech- 
niques of [JMW89], [Bas90a] shows an fi(n/logn) lower bound for computing the GCD, 
using { + ,— , |_-J > Xc,/c}, where x c and / c are multiplication and division by constants. 
(Note that in such a model the mod operation cannot be performed in 0(1) operations.) 
The work of [IMR83] discusses the relation of the floor operation and indirect address- 
ing. In [D085, Jia79], an fi(nlogn) lower bound for sorting rational numbers, using a 
restricted floor operation, is given. 

The aim of our research is to establish lower bounds for computations that use the 
floor operation, in addition to rational operations. The main motivation for considering 
the floor operation, is that it is simple to implement in practice, and that it exists as a 
basic operation in most computers and programming languages. 
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the average error is less than e, requires at least fi(Wioglog ^) operations. (In the lower 
bound we assume that the computation receives only the input x 6 [1, 2], as in the case of 
Newton iterations, and tries to produce the best approximation it can, given a bound on 
the number of operations.) In a somewhat different setting, we show how to approximate 
the s t root, for any x G [1,2]. The approximated value, for any input x £ [1,2], is at 
most e away from y/x, and the number of operations is 0(y\og\og ^). (In the upper 
bound, the computation receives both the input x £ [1,2] and the accuracy parameter 
*■) 

All our lower bounds assume that we have to generate all the constants that are used 
in the computation explicitly. We show that without this restriction any polynomial 
could be computed in 0(1) operations (independent of its degree). In order to avoid any 
possible confusion, we quantify our lower bounds by stating that the computation has 
initially only the constants {0, 1}. 

The research in this work was done in collaboration with Baruch Schieber and Prasoon 
Tiwari, and preliminary versions of it appear in [MSTS8, MST89b, MST89a]. 

This thesis is organized as follows. In Chapter 2 we define the computation model 
and prove a few results concerning polynomials and rational functions. In Chapter 3 
we develop the proof technique for one variable functions. In Chapter 4 we extend the 
technique from one variable to two variables, and show the lower bound for computing 
the greatest common divisor of two integers. In Chapter 5 we show the lower bounds for 
approximation. In Chapter 6 we show the upper bounds. In Chapter 7 we summarize 
the results and suggest directions for future research. 



Chapter 2 



Preliminaries 



2.1 Computation Tree Model 

In this section, we define the computation tree model. Let us motivate the definition 
chosen. Consider a specific computer, it has some set of instructions that are built into 
it. A program that runs on that computer uses this set of operations, usually referred to 
as "machine instructions". A natural measure of the time complexity of a program is the 
number of machine instructions that it performs. Although this approach assumes that 
each machine instruction requires the same amount of time, in many cases this gives a 
good approximation. 

This means that it is natural to fix some set of operations, and consider them as basic 
operations. The time complexity of computing a certain function is the number of basic 
operations performed. 

We would like a formal model that would make it "easy" to define time complexity. 
If the set of basic operations does not include a branching operation, then the number of 
operations performed, on any input, is simply the number of operations in the program. 

15 



2.1. COMPUTATION TREE MODEL 17 

Definition 1 A computation tree T consists of a labeled binary tree, and a set of 
operations OP. Each vertex of the tree is labeled in one of the following ways. 

• Input vertices: An input vertex is labeled by a certain input, e.g. x\. Each input 
vertex has at most one child. 

• Computation vertices: Each computation vertex u is labeled with an operation f u = 
0(1?!, . . . ,Vk), where o G OP, and u; is either a computation or an input vertex that 
appears on the path from the root to node u. Each computation vertex has at most 
one child. 

• Comparison vertices: Each comparison vertex u is labeled with v\ < v 2 , where, 
again, y t - is either a computation or an input vertex that appears on the path from 
the root to node u. Each comparison vertex has two children. 

Given an input the computation tree defines a computation. The computation tra- 
verses a path from the root to a leaf and assigns values to the input and computation 
vertices on the path in the following way. 

The computation starts at the root of the tree T, and proceeds as follows: 

1. When the computation arrives at an input vertex u, labeled by a:,-, it assigns u the 
value of the input x,, and continues to the child of u. 

2. When the computation arrives at a computation vertex u, labeled by /„ = op(v-y, . . . , 
Vk), it assigns u the value val(u), which is op(val(vi), . . . , val(vk)), where val(vi) is 
the value of vertex v,. Note, that since u,- appears on the path from the root to u, 
the computation already assigned some value to i>,-. The computation continues to 
the child of u. 

3. When the computation arrives at a comparison vertex u, labeled with t>i < v 2 , it 
proceeds to the left child of u, if val(vi) < val(v 2 ), and to the right child of u, 
otherwise. (A comparison node does not have a value.) 
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2.2 Polynomials and rational expressions 

This section includes some basic definitions and results about polynomial and rational 
expressions. These results will be used extensively in the following chapters. 

The degree of a polynomial P(x\, . . . , Xk) with respect to a variable x,, denoted 
deg x .(P), is the maximum exponent of #,- appearing in any monomial of P{x\, . . . ,Xk). 
The degree of P, denoted deg(P), is max,- deg x> (P). 

The size of the coefficients plays a crucial role in the lower bounds that we prove 
later. For this reason we add another parameter, which is the maximum coefficient of P, 
denoted max-coef[P). The value of max-coef(P) is defined to be the maximum among 
the absolute values of the coefficients of P. 

We extend the notion of degree and maximum coefficient of a polynomial to a set of 
polynomials, in a rather straightforward way. For a set A of polynomials, the degree of 
A, denoted deg(A), is maxp € A deg(P). Similarly, the maximum coefficient of A, denoted 
max-coej{A), is maxpgA max-coef(P). 

The following lemma relates the degree and maximum coefficient of polynomials and 
the operations {+,—,*}. 

Lemma 2.1 Let P(xi, . . . , x^) and Q(xi, . . . , Xk) be two multivariate polynomials. 
Then, 

1. deg(P ±Q)< max{<%(P), deg(Q)}, 

2. max-coef{P ± Q) < max-coef(P) + max-coej{Q), 

3. deg{P *Q)< deg(P) + deg(Q), and 

4. max-coef(P * Q) < (1 + min{deg{P), deg{Q)}) k max-coef(P)max-coef{Q). 
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A lexicographic order on polynomials 

We define a lexicographic order on the set of multivariate polynomials. For this purpose, 
we use the following lexicographic order on the set of multivariate monomials. 

Definition 2 For two monomials cx[ l . . . x' k k and dx{* . . . x k k , the relation cx'^ . . . x\ k 
>~ dx[ l . . . x k k holds if either (1) there exists an m, such that i m > j m , and for any I < m, 
h — Jii or (2) ii — ji, for 1 < / < k, and \c\ > \d\. 

We say that a polynomial is written in its normal form if it is written as a minimal 
sum of monomials, and these monomials are sorted in descending lexicographic order. 
We assume that all polynomials are written in their normal form. 

Definition 3 For two multivariate polynomials P(xi, ... ,Xk) and Q(xi, . . . ,Xk), P(x\, 
. . . ,Xk) >- Q{xi, . . . ,Xk) if, when written in their normal forms, there exists some i > 1, 
such that (the i-th monomial in P) >- (the i-th monomial in Q), and all the monomials 
preceding it are identical in both P and Q. 

Given a polynomial P(xi, . . . , x*), let the leading monomial of P be the first monomial 
in the normal form of P(a?i, . . . , Xk). Let the leading coefficient of P(x u ...,Xk) be the 
coefficient of this monomial. Define 



sign(a) = < 



+ 1 for a > 

for a = 

-1 for a < 



The sign of a polynomial P, denoted by sign(P), is the sign of the leading coefficient of 
P. Note that sign(P) = if and only if P is the zero polynomial. A sign of a rational 
expression R(x) = ^jfj, is sign(P) sign(Q). 
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To complete the proof we show that if b > tt 2 (P) = ^f± then (1 - J?k=i SJ) > 0, or 
equivalently, J2T=\ ^ < 1- This later inequality follows from 

f M<^f /-^ M y f L \ k M L l 

h Lbk ~ L h < L^Ul + L) < LM + Ll-jfe- ■ 

Similarly, we can prove the Lemma for the case L < 0. □ 

From Lemma 2.3 we can infer a sufficient condition for univariate polynomials. 

Corollary 2.4 For each polynomial P{y), there exist a positive integer ir(P) such 
that for all h > w(P), sign{P{b)) equals sign(P). Furthermore, tt{P) < max -^f(P) + ^ 
where L is the leading coefficient of P. 

Corollary 2.4 enables to argue along the following lines. Consider a comparison be- 
tween Q(x) and P(x), where both Q and P are polynomials. By Corollary 2.4, there 
exists a value e = n(P-Q), such that for x > e, either P(x)-Q(x) = 0, P(x)-Q(x) > 0, 
or P(x) — Q{x) < 0. This means that, for x > e, the value of the comparison between Q 
and P is fixed. In a similar way, Lemma 2.3 allows us to argue about the comparison of 
bivariate polynomials. 



Polynomial division 

The lemma below shows how the degree and the maximum coefficients of bivariate poly- 
nomials are affected, when we perform polynomial division. Note that the following 
lemma is restricted to the case that the leading monomial has only one variable. 

Lemma 2.5 Let P(x,y) and Q(x,y) be two bivariate polynomials with integer coeffi- 
cients. Let max-coef(P) = M, max-coef[Q) = N, and 8 = max{-l, deg x (P) - deg x {Q)}, 
If Lx d is the leading monomial of Q(x,y), where d > 1, then there exists A(x,y) and 
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Therefore we can bound the maximum coefficient of R, using the inductive hypothesis, 
as follows, 

max-coef{R) < (2 + deg y (Q)) s max-coef[S)N s 

< (2 + deg y (Q)) s ((2 + deg y (Q))MN) N s 

< (2+deg y (Q)) s+1 MN s+1 . 

Furthermore, deg x (R) < deg x (Q){ = d}, and deg y (R) < deg y {S) + (6 - l)deg y {Q) 
<deg y (P) + S'deg y (Q). 

In a similar way the bounds on max-coef[A) and deg(A) follow. □ 

For most of our applications the following simplified version of Lemma 2.5 would be 
sufficient. 

Corollary 2.6 Let P(x, y) and Q(x,y) be two bivariate polynomials such that deg(Q), 
deg(P) < D, and max-coef(P), max-coef(Q) < M. If Lx d is the leading monomial of of 
Q(x,y), then there exists A(x,y) and R(x,y) such that 

where 8 = max{-l, deg x (P) - deg x (Q)} < D. Furthermore, A(x,y) and R(x,y) are 
polynomials with integer coefficients, that satisfy, 

1. max-coef[R) < ((D + 2)M) D+2 , 

2. max-coef(A) < ((D + 2)M) D+1 , 

3. deg x (R) < D, and deg y {R) < (D + \)D. 

4. deg x (A) < D, and deg y (A) < D. 



Chapter 3 



One Variable Functions 



3.1 Overview 

Proving that a polynomial cannot compute a certain boolean function, using only a 
certain number of { + ,—,*} operations, can be clone by arguing about the degree of 
the polynomial. In order to make the case more interesting, assume that we relax the 
requirement on the output of the polynomial such that it evaluates to a non-negative 
number if and only if the computed function is TRUE on that input. On the one hand, 
the number of alternations between TRUE and FALSE of the output of the computed 
function gives a lower bound on the degree of the polynomial. On the other hand, in order 
to evaluate a polynomial of degree d, one must perform at least log d multiplications. 
This degree argument is independent of the size of the coefficients of the polynomial, 
and applies even if the input is restricted to be integer. The degree argument can be 
extended also to rational functions, and to computation tree with rational operations 
(i.e. { + ,—,*,/}). Stockmeyer [Sto76] showed, using a similar argument, an fl(n) lower 
bound for the depth of any computation tree that computes the parity, for any n bit 
integer, using rational operations. 

27 
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Doing it in a straightforward way may cause a "huge" increase in the degree. 

For example, consider the expression | , where x is an n bit integer. If R is a 
rational function such that R(x) = § , then R has degree 2 n_1 . We are interested in a 
transformation that keeps the degree and the coefficients "small". In this specific case, 
a good solution would be to set | = 0, and restrict x to integers that are divisible by 
2, i.e. only even integers. 

The general solution has a similar flavor. We restrict the input, so we can set the 
value of the floor operation equal to a rational function. In this transformation, the 
degree remains the same, and the size of the coefficients may grow at most exponentially 
in the degree. Based on this technique, we show how to derive lower bounds for a decision 
tree. 

In Section 3.3 we show how to derive lower bound for various functions, using the 
general technique. Although our proof technique is for decision problem we can use it 
to derive lower bounds for computation problems as well. In the cases that we consider, 
the decision problem is the parity of the output of the computation problem. Since the 
floor is a basic operation, we can compute the parity from the output in 0(1). This 
implies that for such decision problems, the computation problem is at least as hard as 
the decision problem. 



3.2 Proof Technique 

We start by defining the subsets of the input that will be used in our proof. 



Definition 4 Let the set S(n,\) be the set of all n-bit integers that are multiples of 
an integer A. 
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The interesting case of the proof technique is the floor operation. This is by far more 
involved than the previous cases. After proving this lemma, we show how to combine 
the techniques we developed to derive a lower bound for decision trees. Recall that we 
restrict the floor operation to non-negative inputs. 

Lemma 3.4 Let P be a rational expressions with integer coefficients, of degree at 
most D and maximum coefficient at most M , defined over S(n, A), such that P(x) > 
for any x G S(n, A). There there exists -k < (2M) D+2 , and a rational expression Q with 
integer coefficients, where deg(Q) < D and max-coef[Q) < (2M) D+1 , such that for any 

xeS(n,\n), Q(x)= [P(x)\. 

Proof: If sign(P) = 0, i.e. P is the zero function, then let Q(x) = 0/1 and 7r = 1, 
and the lemma follows. Otherwise, let P(x) = ^|||. Without loss of generality we may 
assume that sign(P 2 ) = +1. We would like to restrict our attention to a subset of the 
inputs, for which the sign of P(x) is fixed. By Corollary 2.4 for x > max{7r(Pi),7r(P 2 )}, 
sign(P(x)) equals sign(P), which equals sign(P 1 )sign(P 2 ). Since the operand of the floor 
is non-negative, sign(P) = +1, which implies that sign(Px) = +1. 

We consider three cases of relations between Pi and P 2 . 

Case 1: P x (x) = P 2 {x). This is the trivial case. Let Q{x) = 1/1, and the lemma follows. 

Case 2: P t (x) -< P 2 (x). Let B(x) = P 2 (x) - P^x). Since the leading coefficient of P 2 is 
positive then sign(B) = +1. Corollary 2.4 guarantees the existence of a positive integer 
tv(B) such that B(a) > 0, for all a > max{ 7 r(P 1 ),7r( J P 2 ),7r(5)} = tt. Since P 2 (x) > 0, 
the expression P 2 (x) - P a (x) > is equivalent to 1 > g||J = P(x). Since P(x) > 0, 
for x e 5(n,A), 1 > gg| > 0, for a G 5(n,A7r). We conclude that for any integer 
a G S(n,Xir), [P(a)\ = 0. Let Q{x) = 0/1. Since tt < 2{D + 1)M 2 + 1, the lemma 
follows. 
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We need to bound the parameters tt, deg(Q) and max-coef(Q). To bound deg(Q) 
and max-coeJ{Q) it is sufficient to consider the polynomial A(x) + cL d . Clearly, 
deg(A(x) + cL d ) < D. The value of max-coef(Q) is the maximum between L d and 
max-coef(A(x) + cL d ). By definition, max-coef{A(x) + cL d ) < max-coef[A) + L d . 
Using Corollary 2.7, max-coef(A) < 2 D M D+1 . Therefore max-coef(Q) < (2M) D+1 . 

To bound the value of tt, we bound tt{B); the bounds for tt{^P 2 + R), ^(Pi), 
and ir(P 2 ) are smaller, and derived similarly. Recall that L d B{x) = L d P 2 (x) — 
(«fP 2 {x) + R(x)). Therefore, max-coeJ(L d B) < {L d -f) max-coef(P 2 ) + max-coef(R) 
< M D+1 M + 2 D+l M D+2 . Since L > 0, for any x, sign(B(x)) = sign{L d B{x)). 
Hence, in order to bound ir(B) it is sufficient to bound Tr(L d B). Since L d B(x) is a 
polynomial with integer coefficients, its leading coefficient is at least one. Therefore, 
by Corollary 2.4, ir(L d B) < max-coeJ[L d B) +1. By requiring that tt is divisible by 
L d , we may add L d - 1 to ir(B), therefore, tt < n(L d B) + l + (L d -l) < (2M) D+2 . 

Subcase 2: 7 = 0. Clearly R(x) -< L d P 2 (x). Using Corollary 2.4, let tt be the 
minimum multiple of L d such that tt > max {tt(R), n(Pi), tt(P 2 ), ix{L d P 2 ± R)}. 
Then, for all a > tt, jgjfa = 0, if szgn(R) = 0, < j$fa < 1, if sign(R) = +1, 
and _1 < l*fZU < °' if si 9 n ( R ) = - 1 - Recall that the free term of A is L d c + 7. 
Let c be c if sign(R) > 0, and c - 1 if sign(R) = -1. We conclude that for each 
a e S(n, Att), [ggij = ^-^(a) + c. Let 

g( z ) = A ^ + cLd = L P( x )j for x € 5(n, Att). 
We can bound the parameters 7r, deg(Q) and max-coef(Q) as in the previous subcase. 

Since one of the above relations has to hold between P t and P 2 , the lemma follows. D 

To summarize, so far we considered each operation by itself. Lemma 3.4 guarantees 
that each floor operation can be replaced by an evaluation of a rational expression. 
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3. Let Si = {F Vj ,G V] | Vj is a computation vertex, j < i}. Define D{ = deg(T,i) + 1 
and Mi = max-coc/(S,-). Then, D x < 2\ and max{A< ,- >, Af,-} < 2 2 "\ 



We show how to define A^ 1 " 1 " 1 ) and how to choose the vertex u,- +2 such that Properties 
1-3 will be satisfied by the set S(n, A* t+1 ') and the prefix of V that starts at v\, and ends 
at Vi +2 . The parameter A (t+1) will be a multiple of A^. By Claim 3.1 this implies that 
S(n, \(' +1 ') C S(n,\W), therefore, by the induction hypothesis we have that Properties 
1-3 are satisfied by the set 5(?i,A (,+1) ) and the prefix of V that starts at Ui and ends 
at v i+ i. In order to complete the proof of the lemma we need to show that (a) there 
exists an outgoing edge of v,- +1 such that for each input a G S(n, A( i+1 )) the computation 
follows this edge, and (b) Properties 2-3 are satisfied also for the vertex u,- +1 and the set 
S , (n,A(*'+ 1 )). 

By the definition of the tree T, the vertex u,- +1 is either an input vertex, a comparison 
vertex or a computation vertex. For an input vertex both (a) and (b) hold trivially. 

For a comparison vertex we need to show that (a) holds. Let the comparison be 
g > h, such that g, h <E {0,1} U {f V} | Vj is a computation vertex, j < i}. By the 
induction hypothesis both h and g can be represented as rational expressions of degree 
at most D{ < 2\ and maximum coefficient M,- < 2 2 ' . By Lemma 3.2, there is a 
A(«+i) < 2A<*'>(A + l)M? + \M < 2 24( ' +1)2 . For a comparison vertex (b) holds trivially. 

For a computation vertex (a) holds trivially, so we need to show only that (b) holds. At 
a computation vertex, either f Vi+1 = goh, for o e {+, -, *, /}, or f Vi+1 = [g\ is evaluated, 
where, g, h e {0,1} U {f v . \ Vj is a computation vertex, ;' < i}. If o 6 {+,-,*,/}, 
then the induction step follows from Lemma 3.3. If f Vi+1 = [g\, then, by Lemma 3.4, 
there is a rational expression Q, such that Q — [g\. By Lemma 3.4, A+i < A < 2', 
M i+1 < (2M,) D > +1 < 2( 1 + 24,2 )(2'+D < 2 24(,+1)2 , an d A^ 1 ) < A(')(2M,) D ' +2 < 2 24( ' +1)2 . D 
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The following corollary states that for M(n) = 0(2 nE ), the lower bound is still Q,(^/\ogn). 
This corollary is used later to derive an Q.(y/\ogn) lower bound for various functions. 

Corollary 3.7 Let f be an M(n) -invariant function, such that M(n) = Q(2 n ') for 
some fixed e > 0. Then any computation tree with OP = {+,—,*,/, |_ - J}, that computes 
f(x), for all n-bit integers, must have depth f2( v / Iog~n). 

Using Corollary 3.7 we show lower bounds for the following problems. 

1. Decide if [log a\ is odd or even, for any n-bit integer a. (choose M(n) = 2 n_1 , 
a x — A and a 2 = 2A.) 

2. Decide if [log log a\ is odd or even, for any n-bit integer a. (choose M(n) = 2 n/2 , 
ai = X and a? = A 2 .) 

3. Decide if y/a is an integer, (choose M(n) = 2 n/2-1 , a Y = A 2 and a 2 = 2A 2 .) 

Theorem 3.6 gives an Cl(y/logn) lower bound on the depth of any decision tree with 
OP = {+, — , *, /, [-J }, that solves the above problems. 

At first sight it seems that all of the above lower bounds are for decision problems, 
and may be unrelated to the similar computation problems. In general this may be the 
case, but our examples have the property that the decision problem is closely related to 
the corresponding computational problem. For all the above decision problems, given a 
solution to the corresponding computation problem, one can decide the decision problem 
in 0(1) operations. 



Chapter 4 



Lower bound for GCD 



4.1 Overview 



In this chapter we extend the proof technique developed in Chapter 3 to functions with 
two variables and apply it to show a lower bound for computing the greatest common 
divisor of two integers. Although this chapter does not depend on chapter 3, it is advisable 
that the reader would read chapter 3 prior to reading this chapter. 

Before explaining the lower bound technique for functions of two variables, we argue 
why the technique for one variable does not extend immediately to two variables. The 
most obvious reduction would be to concatenate the two inputs, each of size n bits, to 
one input of size 2n bits, and define the function with respect to this single variable. In 
order for the reduction to work, we need to argue that given this input, we first extract 
the two n bit integers, and continue the computation using them. The problem is that 
in order to extract the two integers one may be required to perform fi(log n) operations. 
This f2(log n) additive factor would be more than the lower bound that we can prove for 
the one variable function. 

39 
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4.2 Inputs structure 

We start by defining the subsets of inputs that we will consider in the proof. Recall that 
for the one variable case, the subsets of the input that we considered were all the integers 
divisible by some integer. Unfortunately, the structure of the subsets of the inputs is 
substantially more complex in the two variable case. 



Definition 6 Let r, a 2 and a 3 be non-negative integers, and ax a positive integer. Let 
A = (6-1,62, ... , 6 r ) and A = (A l5 A 2 , . . . , A r ) be r -dimensional vectors of positive integers. 
For positive integers u 0) i*i, u T , and u r+1) the pair (u ,«i) is < r,a x ,cx 2 ,a 3 , A, A >- 
generated by the pair (u r ,u r+1 ) if there exist positive integers u 2 , u 3 , . . . , u r _! such that: 

u = Xi(u 1 ) h +u 2 , (4.1) 

m = \ 2 {u 2 ) 62 + u 3 , 

«« = ^i+i(ui+i) S ' +1 + u i+2 , 

"r-l = \ r (u r ) 6r + U T + 1 , 

u T > (u r +i) a \ (4.2) 

u r+1 > a 2 , 

u r = u r+1 = 1 (mod a 3 ). (4.3) 

In this case, (u r ,u T+1 ) is the < r, a u a 2 , a 3 , A, A > - generator o/(u ,«i), and u 2 ,u 3 , 
. . . ,u r+1 is the < r, ot\,ct 2 , a 3 , A, A re generating sequence for (u ,Ui). 
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The following lemma shows that when we add another variable, and thus adding 
another element to A and A, we restrict the "allowable" inputs to be a subset of the 
previous "allowable" inputs. 

Lemma 4.2 (The Containment Property): Let A' and A' be (r + 1) -dimensional 
vectors of positive integers, obtained from r-dimensional vectors A and A by appending 
positive integers 8 and X, respectively. Then, S(r+l,ati, a 2 , a 3 , A', A') C S(r, a l5 a 2 , a 3 , A, 
A), provided 8 > cx x and A = (mod a 3 ). 

Proof: Suppose that (a , a x ) G S(r -f 1, a^,a 2 , a 3 , A', A'), and let a 2 > a 3 > . . . > a T > 
a r+1 > a r+2 be its generating sequence. By definition, a r = Aa* +1 + a T + 2 . Therefore, 
a r = 1 (mod q 3 ), and a r > a"| x . In addition, a r+i > a"\_ 2 > a 2- Hence, (a ,ai) G 
S(r,a 1 ,a 2 ,a 3 ,A,A). D 



4.3 The proof technique 

As in the proof technique for one variable, we start by showing how to handle a compar- 
ison. We show that we can restrict the inputs to the comparison, so as to ensure that 
the value of the comparison is fixed. 

Lemma 4.3 Let P(x,y) and Q(x,y) be two rational expressions with integer coeffi- 
cients, of degree at most D and maximum coefficient at most M, defined over 5(0, c*i, a 2 , 
a 3 ). There exists ai < a[ < max{a x , 2D + 1}, and a 2 < a' 2 < max{a 2 ,2(£> + l) 2 M 2 + l}, 
such that for all (a,b) G 5(0, a[, a' 2 , a 3 ), the comparison between P(a,b) and Q(a,b) is 
determined (i.e. either P{a,b) = Q(a,b), P{a,b) > Q{a,b), or P(a,b) < Q(a,b), for 
(a,6)e5(0,al,Q' 2 ,a 3 )J, and 5(0, a\, a' 2 , a 3 ) C 5(0, a u a 2 , a 3 ). 

Proof: If P(x,y) = Q(x,y), for (x,y) G 5(0,ai,a 2 ,a 3 ), then let a[ = a x and q 2 = a 2 , 
and the lemma follows. 
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1. There is a rational expression Q{x,y) with integer coefficients, and integers a[, a 2 , 
TV such that 

[P(x,y)\=Q(x,y) 

for each (x,y) G S(0,a[,a' 2 , 7ra 3 ). Furthermore, max{a' 1? deg(Q), } < AD A , and 
ma,x{7ra 3 ,a' 2 , max-coef(Q),} < 2(4D 2 M) 2D \ or 

2. There is a rational expression Q(y,z) with integer coefficients, and integers a[, a' 2 , 
it, A and 6 such that 

[P(x,y)\=Q(y,z) 

where (y,z) is the generator of (x,y) G 5(1, a[, a' 2 , 7ra 3 , {6}, {A}). Furthermore, 
max{a' l5 deg(Q), 8} < 4D 4 , and max{Tra 3 , a 2 , max-coef(Q), \} < 2(AD 2 M) 2E>3 . 

Proof: Let d x — deg x (P 2 ), d y = deg y (P 2 ), and let Lx dx y' be the leading monomial of 
P 2 (x,y). Without loss of generality we may assume that L > 0, i.e. sign(P 2 ) = +1. 

In order to simplify the proof, we would like to fix the inputs such that sign(P(x, y)) — 
sign(P). By Lemma 2.3, there are two integers tti(Pi) and 7r 2 (Pi), such that for each 
(u,v) 6 5 , (0,7Ti(P 1 ),7r 2 (Pi), 1), sign(Pi(u,v)) = sign(P\). By a similar argument there 
are 7Ti(P 2 ) and 7r 2 (P 2 ) such that for each (u,v) e S(0, 7r 1 (P 2 ), ^(Pi), 1), sign(P 2 (u,v)) 
— sign(P 2 ) = +1. Since the input to the floor operation is non-negative, sign(P) = 
+ 1. Recall that sign(P) = sign(P 1 )sign(P 2 )- 1 this implies that sign(P 1 ) = +1. Let 
&i = max{a 1 ,7r 1 (P 1 ),^ 1 (P 2 )} and a 2 = max{a 2 , 7r 2 (Pi), 7r 2 (P 2 )}. Note that for each 
(u,v) € 5(0,ai,a 2 ,l), both P\{u,v) > and P 2 (u,v) > 0. Since the coefficients of Pj 
and P 2 are integers, by Lemma 2.3, o^ < D + 1 and a 2 < M + 1. 

In the remainder of the proof we consider the following five cases in the following 
order: 

1. P 1 (x,y) = P 2 (x,y). 

2. P x {x,y) ~< P 2 (x,y). 
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and 7 is a non-negative integer such that < 7 < L. Let it = L, a[ = &i, and a' 2 = a 2 . 
Then, for each (u,v) £ S(Q,a' l ,a' 2 ,Tra 3 ), 

Pi(u,v) mod L = Pi(l, 1) mod L = 7, 
Since u = v = 1 mod L. By writing the mod operation explicitly, we have, 



P 1 (u,v) 



Therefore, for (x,y) € S(0,a\,a 2 , 7ra 3 ), 



AKu) 



1 + 7 



fl(«,«) 



A(",t;)-7 



Q{x,y) 



Clearly, deg(Q) < D, max-coef[Q) < 2M, and r — L < M. 



Case 4-' The leading monomial of P 2 {x,y) is Lx dx , i.e., no power of y appears in the 
leading monomial of P 2 (x,y). We use Corollary 2.6 to divide Pi(x,y) by P 2 (x,y) as 
polynomials in x. By Corollary 2.6 there are polynomials A(x,y) and R(x,y) with 
integer coefficients such that 



Pi(x,y) = 



A(x,y)P 2 (x,y) + R(x,y) 



Note that since Case 3 does not hold, deg x {P 2 ) > 1. Furthermore, the following properties 
hold: (i) d < D, (ii) deg x (R) < deg x {P 2 ). (iii) max-coef(R) < {{D + 2)M) D+ \ (iv) 
max-coef{A) < ((D + 2)M) D+1 , (v) deg x (R) < D, and deg y {R) < (D + 1)D. (vi) 
deg x {A) < D, and deg y (A) < D. 
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Consider input (u,v) E 5(0, a\,a 2 , ia 3 ). Then, both V(u,v) < P 2 (u,v) and < 
V(u,v). Since P 2 (u,v) > 0, this implies that, 

V(n,u) _ 7.P 2 (m,i;) + .R(u,t;) 
P 2 (u,i;) L d P 2 (u,v) " 

We conclude that for each (x,y) E. S(Q, ct\, a' 2 ,ira3), 

[P{x,y)\ = — d = Q{x,y). 

Clearly, deg(Q) = deg(A) < D, and max-coef(Q) < max-coef(A) + M d < ((2 + 
D)M) D+1 + M D < 2{{D + 2)M) D+2 . 



Subcase 4-2: 7 = 0. In this subcase we restrict the input, such that the value of 
7 lAp 2 (u,v) '" J * s e ither or — 1, depending on sign(R). Define R(x,y) as follows: 



R{x,y) 



R(x,y) 



if sign(R) > 



L d P 2 (x, y) + R(x, y) if sign(R) = -1 



Note that, 



R(x,y) 
[L d P 2 {x,y), 



+ P(R) 



R{x,y) 

L d P 2 (x,y), 



where p(R) = 0, if sign(R) > 0, and p(R) = -1, if s»^n(i2) = -1. The leading 
coefficients of the polynomials R(x, y) and L d P 2 (x, y) — R(x, y) are positive, assum- 
ing that sign(R) / (the case sign(R) = is trivial). Recall that x = L d . Using 
Lemma 2.3, let 



a[ =m&x{a 1 ,TT 1 (R),ir 1 (L d P 2 -R)} < D{D + 1) 

< (^ + 1) 2 , 
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We substitute x by Xy s + z. Therefore, deg(Q) < D(D + 1) = D and max-coeJ[Q) 
< X D 2 D M < 2 D M D+1 = M. By Using Case 4, we get a rational expression Q'(y, z), and 
a'j, a' 2 and ir, such that for each (x,y) 6 S(l,a' 1 ,a' 2 ,Tra3,{6},{\}), 

Q'(y,z)= [Q(y,z)\ = [P(x, y )\ 

where {y,z) is the (unique) generator of {x,y). Now substitute z — x — Xy s in Q'(y,z) 
to get Q(x,y). We compute the following bounds for <5, 

max{de$(Q),ai} < (D + l) 2 

< (£)(£> + 1) + l) 2 

< 4D 4 , 

and 

max{maa;-coey(g),a2,7ra3} < 2 ((/) + 2)M) D+2 

< 2 ((£>(£> + 1) + 2)(2 D M D+1 )) D2+2 

< 2(4£> 2 M) 2D3 . 

Since the relation between Pi and P 2 belongs to one of the five cases, this concludes 
the proof of this lemma. □ 

The following lemma combines the previous lemmas into a proof technique for the two 
variable case. This lemma shows, that given a computation tree, one can find a subset, 
S(r, an, a 2 , a 3 , A, A), such that all the inputs from this subset follow the same path. Once 
we establish this property, we can derive a lower bound for any boolean function that, 
for any such subset, has at least two elements which differ in their outcome. In the next 
section we show that deciding if two integers are relatively prime is one such function. 
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1. For each input (a, b) £ S^ the computation follows the path from the root to u,-+i; 

2. For each computation vertex v on the path from the root to the vertex u,+i, ex- 
cluding the vertex u,-+i, there is a pair of bivariate polynomials (i^(x,y), Gj,(a;, j/)) 
with integer coefficients, such that for each input (a, b) £ «S^'\ Gj,(u,u) 7^ 0, and 
ual(i/) = g^'^ , where (u,v) is the < r<*">, a[ i] , a^ , a$\ A^\ A^ >-generator of 
(a, 6); and 

3. Let S t - = {F % v (x,y),G { (x,y) \ j < i}. Define D { = max{^(S t ),4} and M, = 
max{max-coey(Si),4}. Then, ?'('' < i, max{ai , A} < 2 2 ', and max{a4 , 0:3 , M,-} 

< 2 22 . 

We construct the set 5' t+1 ' C S^ such that Property 2 is satisfied also for the vertex 
u;+i and each input (a, 6) £ £( ,+1 ). We also select an outgoing edge of u t+ i and prove 
that for each input (a, 6) £ <S( t+1 ' the computation follows this edge. It is easy to check 
that r' t+1 ' < i + 1. Therefore, in order to complete this proof, it is sufficient to show that 
the following two inequalities hold: 

(i) max{a!*' +1 \A+i} < 2 2<( ' +1) , 

(ii) max{a? +1) ,4 ,,+1) ,M- + i} <2 224( ' +1 \ 

Let us first resolve the case when u,-+i is a comparison vertex. Then by Lemma 4.3, we 
can extend V. Furthermore, a{ ,+1) < 2 A + 1 < 2 2<( ' +1) , and a { 2 i+1) < 2M 2 (A + l) 2 + 1< 
2 . Clearly, inequalities (i) and (ii) hold in this case. 

Next, consider the case when t>, +1 is a computation vertex. We divide the discussion 
to two cases. The first case is when the operation is a rational operation, and the second 
case is when the operation is floor. 

Suppose Vi+x is v o /j., where v and /j, are previous computation vertices, and £ 
{+,-,*,/}. By the induction hypothesis, val(v) = §$£ and val{fi) = |^j}, where 
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(a , ax) = ((1 + a 3 )< e+1 ^ , (1 + a 3 ) e ), and (6 , &x) = (1 + a 3 (l + a 3 ) (e+1)ai , (1 + a 3 ) e ). In 
order to see that (a ,ai) E. 5(0, ai,a 2 , a 3 ), one can verify that (1) a > a" 1 , (2) ax = 
(1 + a 3 ) e > a 2 , and a = ai = 1 mod a 3 . (A similar argument holds for (b ,bi).) Clearly, 
gcd(a , fll ) = (1+ a 3 ) e ^ 1, and gcd(6 , &i) = 1. 

In order complete this lemma, it is sufficient to prove each of the numbers a ,ai, b , 
and bi is less than 2 2t ' i+1) . Since e is the least exponent such that (1 + a 3 ) e > a 2 , 
then (1 + a 3 ) e < 2 2t . The desired upper bounds are an immediate consequence of this 
observation. O 

Intuitively, the previous lemma shows, that if we terminate with a large subset 5(-), 
there is still a pair of inputs, one of which is misclassified. 

Theorem 4.8 Any decision tree with OP = { + ,—,*,/, |/J } an d constants {0,1}, 
that decides if a and b are relatively prime, for all integers 2 n > a > b > 0, must have 
depth J7(loglog n). 

Proof: Without lost of generality we may assume that the leaves are labeled by the 
constants zero and one, and that the two first vertices are input vertices. (This can 
increase the depth of the tree by at most four.) Suppose that we are given a decision 
tree T of depth h < iloglogfa 1 / 5 ), with OP = {+,-,*,/, [-J}, that decides if a and 
b are relatively prime, for all integers 2" > a > b > 0. By Lemma 4.6 we have the 
following: (i) there is a path V from the root of T to a leaf v, and a set of inputs 
S = 5(r,a!,a 2 ,a 3 , A, A)n {(a, b) : < a, 6 < 2 n }, such that for each input (a, b) € S, 
the computation follows the path V; (ii) The leaf v is labeled by a constant (zero or 
one), (iii) a x < 2 2<h < n 1 / 5 , a 2 , a 3 < 2^ < 2 n "\ ma x-coef{^^) < 2 nU \ and 
deg( c}(x' y J) ) < nl/5 - O ur S°al is to arrive at a contradiction using Lemma 4.7. 

Towards this end, let t = n 1 / 5 - 1. We claim that each pair (u,v) <E 5(0, a l5 a 2 ,a 3 )n 
{(u,v) : 1 < u,v < 2 2i (' +1 )} generates a pair (a,b) G <S = 5(r,a l5 a 2 ,a 3 , A, A) = 
S(r, <*i, a 2 ,a 3 , A, A)D {(a, b) : < u,v < 2 n ). Recall that the first two vertices of the 
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Approximation of Real Functions 



5.1 Motivation 

It is not hard to show that the square root function cannot be computed using ratio- 
nal operations only. For example \/2 is not a rational number; therefore it cannot be 
represented by rational operations on {0,1,2}. The floor operation would not resolve 
this problem, since its output is always an integer. Still, we would like to argue about 
computations that involve such functions. 

One direction of research may be to assume that the square root function is one of our 
basic operations in the computation tree, and consider the complexity of computing other 
functions, given it as a basic operation. Another direction is to relax the requirement on 
the output of the computation tree. Instead of requiring it to be equal to the computed 
function, we can require it to "approximate" the computed function. 

The first approach, where the square root is consider as a basic operation, was in- 
vestigated in [SY76, PipSl, Ja'81]. The main complexity measure that they considered 
was the number of square root operations that have to be performed. It was shown, for 
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The norm L^ is defined as, 

ll/lloo = SUp |/(:r)| 
r€A 

It is well known that the norm of a function ||/||fc is monotonically non-decreasing in the 
parameter k, i.e. \\fl\i < ||/|| 2 < • • • < ||/||<x>- This implies that showing a lower bound 
for H/lli, implies a lower bound for any L^, including L^. 

We still need to make a connection between the norm of a function and the quality 
of an approximation. Given a certain point x, the difference between g{x) and h{x) is 
a measure of how well g approximates h at that point. This suggests that the norm of 
g — h would be a good indicator of how well g approximates h. More formally, we define 
a function errork(g, h) as follows, 

error k (g, h) = \\g - h\\ k = tf jf \g(x) - h(x)\ k dx 

The function errork(g,h) is how well g approximates h in norm Lk- A function g(x) 
approximates a function h(x), within e, in norm Lk, if error(g, h)k < e. 

The more intuitive norms are probably L^ and L\ (although norm Li plays an im- 
portant role in many applications). The norm L^ is intuitively the worst case, for 
approximation it would mean the worst case approximation, over any input. An e ap- 
proximation in norm L^ implies that for any input in A, the difference between the two 
functions is at most e. The other norm is L\, which intuitively captures the average case. 
In the case of approximation one can view it as the expected error for a randomly chosen 
input. 

In this chapter we show a technique to derive lower bounds for approximation in 
norm L x . Clearly any lower bound in norm L x , implies a lower bound in any other norm, 
including L^. 
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path, P, in the computation tree. Note that this would not be true for an integer input, 
since, for example, branching according to whether the input is odd or even, divides the 
integers to two sets, such that neither set has two successive integers. The other point is 
that we need / to be "large", otherwise we will not be able to establish a lower bound at 
the end. For example, if / < e, then the square root function can be approximated by a 
constant over such an interval. 

The proof that there exists a path P is done by constructing it inductively. The 
inductive claim uses a prefix "P, of V, that includes the first i vertices, and a subinterval 
[ai,a.i + /,■], such that the value of each vertex in P,-, for inputs from [a,i,ai + /,■], can be 
expressed as a rational expression, and any input from [ai,a,i + /,] follows P,. 

In the inductive step we show that P, can be extended to Vi+\ and maintain the 
inductive claim. Here we have two cases, depending on the type of last vertex of V{. 

If the last vertex in P, is a comparison vertex, we need to show how to choose one 
of its sons as the next vertex u,- +1 . In this case we need to show that there is a "large" 
subinterval [a,- +1 ,aj +1 + /,+i], where all the inputs continue to u,-+i. 

The other case is when the next vertex on P is a computation vertex. If the computa- 
tion vertex has a rational operation, then the inductive claim is maintained trivially. The 
interesting case is when the vertex computes the floor function. Such a vertex receives as 
an argument the value of some previous vertex on P,, which by the induction hypothesis 
can be expressed as a rational expression. Using the above argument about \r(x)\, there 
is a "large" subinterval [a, +1 ,a I+1 + l i+1 ], in which the value of this vertex is constant. 
We continue by setting the value of the vertex to that constant, and restrict the inputs 
to the subinterval [ai+i,a,+i + /;+i]. 

To summarize, the construction so far shows that for any computation tree T, defined 
over inputs x £ [1,2], there is a subinterval [a, a + /], and a rational expression r(x), such 
that T(x) = r(x) for any x e [a,a + /]. This by itself does not imply any lower bound 
but reduces the problem of proving the lower bound for T, over [1,2], to the problem of 
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lemma shows that the "penalty" of restricting the rational function to a subinterval in 
which it has no roots or poles is rather limited. 

Lemma 5.2 Given an interval [a, a + /] and a rational function r{x) of degree d, 
there exists a subinterval [a, a + jj\ C [a, a + /] that does not contain any poles or zeros 
ofr(x), and furthermore, r(x) is monotone in this subinterval. 

Proof: Let r(x) — 4^4 where p{x) and q(x) are polynomial of degree at most d. Consider 
the 6c? + 1 subintervals [a, a + 55^], [a + 53^,0 + g^-], . . . , [a + gj^.a + /]. In each 
subinterval in which r(x) has a pole, the polynomial q(x) has a zero. In the worst case, 
each zero of q(x) can be at the boundary of two adjacent subintervals. Since q(x) has 
at most d zeros, the number of subintervals that contain a pole of r(x) is at most 2d. 
Similarly, the number of intervals that contain a zero of r(x) (which is a zero of p(x)) is 
bounded by 2d. Therefore, there are k > 2d + 1 intervals that do not contain any pole 
or zero of r(x). 

Now, consider these remaining k intervals. Each interval where r(x) is not monotone 
contains either a local maxima or a local minima in its interior. At each local maxima 
or local minima, the derivative of r'(x) has a zero. Since the degree of r'(x) is at most 
2d, it can have at most 2d zeros. (Notice that the poles of r'(x) are same as the poles of 
r(x).) Hence, there exist k — 2d > 1 subintervals that do not contain any poles or zeros 
of r(x), and furthermore, r(x) is monotone in each of these subintervals. □ 

The following lemma is a classical result, known as the Markoff inequality. It relates 
the value of the derivative of a polynomial to the polynomial value and degree. 

Lemma 5.3 (Markoff [Che66, page 91]) Let p(x) be a polynomial of degree d, and 
p'{x) the derivative of p(x) with respect to x. Then 



max {\p'(x)\} < d 2 max {|p(a;)|}. 
se[-i,i] x6[-i,i] 
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on polynomials, we can also bound the value of a rational expressions. Using the above 
lemma, we can derive the following lower bound for a polynomial. 



Lemma 5.6 Given an interval [a, a + /] and a polynomial p(x) of degree d > 1 with 
integer coefficients, there exists a subinterval [a,a + g^-] C [a,a + l] such that (i) p(x) is 
monotone in this subinterval, and (ii) \p(x)\ > (^) d for all x £ [a, a + g4-]. 



Proof: Let b be the leading coefficient of p. Since d > 1 then b ^ 0, furthermore, since 
b is an integer, this implies that \b\ > 1. Assume that K is the largest value of \p(x)\ in 
the interval [a, a + /]. First, we argue that there is a subinterval of length 8 — -^ such 
that p(x) is monotone and is greater than K/2, throughout this subinterval. Then, we 
show that K/2 > (//4) d . 

Let yi = a + ^ for i = 0, 1, . . . , 8c? 3 . Consider the 8g? 3 subintervals [j/,_i, j/j], for 
i = 1,2, . . . ,8c? 3 . Choose x £ [a,a + 1} such that |p(x )| = K. Choose j such that 
^o £ [j/j, Vj+i]- Assume that j < 4c/ 3 , and consider d consecutive subintervals [yj+i, J/j+i+i] 
for i = 0, 1, ... ,d — 1. (The case j > Ad 3 is similar.) Since the polynomial p(x) has a 
total of at most d — 1 local maxima and local minima, p(x) is monotone in one of these 
d subintervals. 

Next, we show that |p(a;)| > K/2 for all x G [yj,yj+d\- For all real u such that 
x + u E [a,a + /], Lemma 5.4 implies that \p(x + u)\ > \p(x )\ - u^j-K. Therefore, for 
all x e [ yj ,y J+d ], \p(x)\ >K- ^-fl< = K{1 - J) > \K. 

It remains to show now that K/2 is larger than (^) d . In order to prove this assertion, 
transform the polynomial p(x) into a polynomial q(z) with leading coefficient one, in the 
interval [—1,1] by setting 

q(z) = t(t) p( z ~ H ~ — ), where b is the leading coefficient of p{x). 
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exists a subinterval [a,a+ j^p], such that \r(x)\ > (2M) -1 (j)~ d for all x € [a,a + ^r], 
and r(x) is monotone in this subinterval. 

The following lemma is the main technique that we use to bound the value of the 
floor function. This lemma guarantees that there is a "large" subinterval in which the 
value of the floor function is constant. This implies that if we restrict our attention only 
to this subinterval, the value of the floor operation is constant. 

Lemma 5.9 Given an interval [a, a + /], where 1 < a < 2 and Q < I < 2 — a, 
and a rational function r(x) of degree d with integer coefficients and max-coef{r) < M ; 
there exists a such that [r(x)\ equals a fixed integer J for all x G [a, a + M~ 1 (fil) d+1 ] 
C [a,a + I], where /x < 1 is a fixed constant that can be chosen to be 0.02. Moreover, 
\J\ < M(fil)- d . 

Proof: By Lemma 5.7, there exists a subinterval [/?,/?+ ^-j in which r(x) is monotone, 
and |r(x)| < 2M(f) d for all x € [/?,/?+ j^). This implies that \r(x)\ takes on at most 
4M(|) d distinct integer values in this subinterval. By the pigeon-hole principle, there is 
a subinterval of length at least 56(i4 ' 8)d where r(x) is a constant. Let J equal the value 
of r(x) in that interval. □ 

So far we were interested in the behavior of rational functions and the floor function. 
The next lemma shows a property of the s ik root function. This lemma establishes a 
connection between the degree of a rational expression, its maximum coefficient, and how 
well it approximates the s ik root. 

We should note that one can bound the approximation as a function of the degree, 
independent of the size of the maximum coefficient. The techniques used to achieve such 
bounds are more involved, and are not needed in our case. (See, for example, [Bra86].) 

Lemma 5.10 For an integer s > 2, given an interval [a, a+l] C [1,2] where 1 < a < 2 
and < / < 2 — a, and a rational function r(x) of degree d with integer coefficients and 
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2. For each computation vertex v on the path V, there is a rational function r u (x), 
with integer coefficients, such that for any input a 6 [a, a + /], the value computed 
at vertex v equals r l/ (a). 

Finally, we conclude the desired lower bound on the length of V (and hence, a lower 
bound on the depth of T) by applying the result of Lemma 5.10 to rational functions 
that approximate ^fx over the interval [a, a + /]. 

Denote the vertices on the path V from the root of T to its leaf vi by v\, v 2 , . . . , i>/, 
in that order, where v^ is the root of the tree T and u; is a child of u,-_i. We define the 
path V and the subinterval inductively, starting with the empty path and the interval 
[1,2]. As part of the induction hypothesis, we maintain three properties of the path and 
the interval under consideration. These properties are described below. 

Suppose that (a) we have selected a prefix of V, which starts at v\, and ends at a 
vertex v i+1 , and (b) defined a subinterval [a,-, cii + /,•] with the following properties: 

1. For each input x G [a,-, a,- + /;] the computation follows the path from the root to 

vi+i; 

2. For each computation vertex Vj on the path from the root to the vertex u,-+i, 
excluding the vertex u,+i, there is a rational function rj(x), with integer coefficients, 
such that val(vj) = rj(x), for any input x € [a,-, a,- + /;], 

3. Let £,• = {rj(x)\vj is a computation vertex, j < i}. Define D, = deg(Hi) + 1 and 
Mi = max-coeftEi). Then, D t < 2', M,- < ^~ 22, \ and /,- > n 2 ^ , where /i is the 
constant in the statement of Lemma 5.9. 

The basis of the induction is i — 0, and the path is v x , which is the root. All the 
claims hold trivially in this case. 

We show how to define [a, +1 , a i+1 + / 1+1 ] and how to choose the vertex v i+2 such that 
Properties 1-3 will be satisfied by the interval [a i+1 ,a i+1 + l i+l ] and the prefix of V that 
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bounded by 2(D, + l)Mf. Therefore, D i+1 < 2A, M i+1 < 2(A + l)Mf. For each 
x 6 [a, + i,a, +1 + /, +1 ] and r,- + i(x), Properties 1-3 are satisfied. 

Suppose that /„ 1+1 = /„ I — [g(x)\, for some 1 < j < i. By our hypothesis, 
deg(g) < D{ and max-coeJ[g) < Mi. By Lemma 5.9, there exists an integer J, and 
an interval [a; +1 ,a; +1 + /{+i], such that (i) for any x £ [a, +1 ,<2i +1 + /,+i], |_#( X )J = J ■> 
\J\ < M i (/i/,-)- D ' < ^- 22 %- 2 '( 1+22 ' 2 ' < M - 22(,+1)2 ; and (ii) / i+1 > ^h) D ' +1 > p** 
A/ (i+2^ 2 )(2'+i) > ^<^)\ Define ri . +1 ( x ) = J. Then, A+i = A and M i+1 = max{A/„J}, 

and properties 1-3 are satisfied. 

Thus, we have proved the existence of a path V and an interval [a, a + I] such that 
properties 1-3 are satisfied for all vertices on V. We have also established that there 
is a rational function r(x) such that for all a € [a, a + I] the value produced by T (at 
the end of the path V) on input a is given by r(a). Furthermore, that deg(r) < 2 h , 
M — max-coef[r) < p~ 2 , and / > p 2 

By Lemma 5.10 there exists a subinterval [a,a + /'], /' = 4(56j ' +2)4) = 0(^-), such 
that for any v in the subinterval, \v l l s - r(v)\ > c(M(sd + 2))- 4 (^) 2sd+3 = 6, where c is 
a constant. This implies that we can lower bound errori(T, ■tfx) as follows, 

error-i(T,^/x) > 81' 

The value of 8 is fi(M" 4 (j)°( 3(i )), and we can show 8 > cf , where c s is a constant 
that depends only on s. Since T approximates y/x, in L\, within e, then 

e>erron(r,^x) > <5/' > cf' 

Therefore, h = Q.(J\og log i). □ 



Chapter 6 



Upper Bounds 



In this chapter we show how the power of the floor function can be used to accelerate 
computations. Clearly the floor function can help in computing the floor function and 
the mod function, but the interesting fact is that it can accelerate computations which 
we normally do not associate with the floor function. 

One issue of interest is the constants that the computation tree uses. A uniform com- 
putation tree has as constants only {0, 1}, and every other constant has to be generated 
explicitly. On the other hand a non-uniform computation tree may have arbitrary initial 
constants. 

We show that many functions can be computed by non-uniform computation trees 
of depth 0(1). This justifies the reason that the lower bounds proved in the previous 
chapters are for the uniform computation trees. For uniform computation trees we show 
upper bounds that, in some cases, match the lower bound that we proved in the previous 
chapters. 

We start, in Section 6.1, by showing that any polynomial has a non-uniform computa- 
tion tree of depth 0(1) that evaluates it, and this constant does not depend on the degree 
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First we show how to construct a constant that encodes all the coefficients of a 
polynomial P(y). The constant has simply all the coefficients in order, each padded 
to m bits, where m will be chosen latter. More formally, let P(y) = ELoP«'2/'> where 
the coefficients, p t , are integers. Define c™ = P(2 m ) = T, d =Q Pi2 mi . For m, such that 
2 m > max-coef(P), it is easy to verify that the m-bit integer obtained by considering bit 
positions im to (i + l)m — 1 of c™ is equal to p,-. 



The polynomial evaluation algorithm: 

The algorithm is constructed to evaluate the inputs of a specific polynomial P in constant 
number of operations. The algorithm receives as an in-put an n-bit integer a. The 
algorithm output is the value of P(a). The polynomial evaluation algorithm consists of 
two steps. 



Step 1. Compute b = [ jJ^J . Letting 7 (a) = £Lo a, 2 m(d ~ t ' ) , we show that b = j(a). 
For | a; | < 1, the following identity holds, ^ = J2Zo x ' '• Recall that a < 2 n . We will chose 
m such that 2 n < 2 m , and therefore 2~ m a < 1. This implies that 1 _ 2 1 _ ma = E^ (2 _m a)', 
and therefore 

Omd oo oo 



1 - 2~ m a 



= ^2 md (2 _m a)' = J22 m(d - i) a i . 



:'=0 .=0 

For m, such that a d+l /2 m < 1/2, the sum E£ d+ i 2 m ( d -^a i is less than one. This, in 
combination with the fact that a is an integer, implies that 



6 = 



]T 2 m ( d -V 



t=0 



d 

i=0 



= ^ 2 m(d-,) a , + 



Y> 2 m(d - i) a i 

i=d+\ 



7 (a). 



The constants used in this step are 2 md and 2 m , and the requirement on m is that 
m > n(d+ 1) + 1. 



Step 2. Extract the ?n-bit integer obtained by considering bit positions md to m(d+l) — 1 
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6.2 Approximations 

The non-uniform case 

We show how to use the construction in the previous section, to approximate yb for any 
b £ [1,2] in constant time. The existence of such a program does not contradict the 
lower bounds of Chapter 5, because it assumes that certain large constants are available 
for "free". In other words, we show a 0(1) non-uniform upper bound on the depth of 
computation trees for approximating \fx. 

For b € [1,2], let a = [46/e 2 J. It is not difficult to see that if \y/a — a\ < 1 then 
\yb — ae/2\ < e. Below, we present an 0(1) step algorithm for computing a. 

Our algorithm is based on Newton iteration for computing square roots. Recall that 
Newton iteration for computing the square root of a is given by 

a -f- x 2 
x,+i = — - — '-, where x is the starting point. 
2.x i 

Define <!>, to be the relative error of x,-, i.e. (1 + Si)y/a — x,-. It is easy to verify that 

^ ~ 0/1 ,'~c — T ^ 2 (tt) 2 '> assuming that S < 1 
2(1 + di-i) 2 

Let x = [3/eJ, this implies that y/a < x < \y/a. Since < 6 < 1/2, this implies that 
Si < 2 • 4 -2 . Our aim is to reach an index t, such that \yfa — x t \ < 1. This occurs when 
\6 t y/a\ < 1, or equivalently \S t \ < ^. For t = O(logloga) = O(loglog(l/e)), the value 
of S t satisfies, < S t < 1/y/a. Thus, in order to compute a, it is sufficient to perform t 
Newton iterations, starting at xo- 

Notice that upon starting the Newton iterations at x , the value of x x is obtained 
by evaluating the rational function R\{y) = ^^ at y = a, where P x (y) - y + x 2 , and 
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The uniform case 

The general idea in this case is similar to the idea in the non-uniform case. Using the same 
logic as before, for any 6 € [1,2] and a = [46/e 2 J , if \y/a — a\ < 1 then \y/b — ae/2| < e. 

The main difference between the construction in the uniform case and the non-uniform 
case is the following. In the uniform case we will consider the approximation, x,, com- 
puted at the i l Newton iteration, as a function of the starting point xq, while assuming 
that a is constant. Recall that for the non-uniform case the roles were reversed; £, was 
viewed as a function of a, while xo was fixed. 

For a starting point x Q , the value of the i tk approximation, x,-, is given by evaluating 
a rational function i/,(x) = ^4|^ at the point x = x , where F\{x) — x 2 + a, G\(x) = 2x; 
and for i > 1, F,(x) = aG^ + F 2 ^, G,-(x) = 2F l _ 1 (x)G i - 1 (x). (Note that the coefficients 
of F, and Gi are integers.) 

The above identities can be used to define a straight line program, of length O(t) that 
computes H t . The rational expression H t performs t Newton iteration for a certain input 
a. This implies that H t (H t (x )) can be viewed as first performing t iterations, starting 
at x Q and ending at x t — H t (x ), and then performing t more iterations, starting at x t 
and ending at x 2t = H t (x t ); and therefore, H t (H t {x)) = H 2t (x). 

For the sake of simplicity we assume that e is a power of two 1 . Let d t = deg(H t ), 
M t = max-coej{H t ), and 2 n = ^. Initially, max-coeJ{Fi), max-coeJ{Gi) < ^ = M . From 
the definition of F; and G' t - it is clear that deg(F t ), deg(G t ) < 2*. One can verify that 
max-coef[Fi), max-coef{Gi) < (Af + 1) 22 '. Therefore, d t = 2* = d and M t < (Mo + l) 2 *' < 
iW = M. 

We choose m, such that 2 m = 2(if) 22<+1 > 2M2 2nd . Clearly 2 m and 2 md can be 
computed in 0(t) steps. Since H t has a straight line program of length 0(t), and we 



: For an arbitrary e we can use the powers of two subroutine, describe in the next section, to find an 
< e which is a power of two. The running time of this procedure would be O(wlog log -). 
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input: b, e 


computation: 


Compute 2 m , 2 dm and c% = H t (2 m ). 


Xq = 


46 




FOR i = ltok DO 


X{ = [/?<(£,■"_! )J/* invoke the polynomial evaluation procedure */ 


a = x'k - 1 


= f 


output (3. 



Figure 6-2: Approximating the square root uniformly 
would be ae/2. To conclude we proved the following theorem. 



Theorem 6.3 There exists a computation tree T(e,x), whose depth is (9(v/loglog -), 
such that for any b € [1,2], 

\T(e,b)-Vb\<e 

Note that the algorithm receives two inputs, e and 6, and outputs T(e,b). 

Approximating the square root of an integer is closely related to deciding if an integer 
is a perfect square (i.e. whether an integer has an integral square root). We could decide 
if an n bit integer is a perfect square by first approximating its square root within one, 
and then checking the integers within one of that value. Note that in the approximation 
procedure above, a approximates y/a within one, where a is an integer. The problem 
in extending the result to deciding perfect squares is that we need to compute a "good" 
initial point, (i.e. such that < S Q < 1/2) and a bound on M . For the approximation 
case, we chose the value of a such that it would be easy to compute a "good" initial 
point for it. The following lemma states that given a "good" initial point, and a constant 
2 > a, we can decide the perfect square problem in 0(-y/logloga). 

Lemma 6.4 There exists a computation tree T(x, x , 2 ! ), whose depth is 0(>/loglogx), 
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computing 6, = a k mod (a — i;_i), we can raise b to the ik power in 0(i) steps, as long 
as b ik < a - b^-^ k . 

2 2 

For a given an integer a, we show how to compute 2 2 ' for all i, such that 2 2 < a. 
The computation is done in i steps. Starting with a\ — (2a) 2 and b\ = 2 2 we compute: 
ai = a\_ x and &,• = a, mod (2a — 6,_i). (Recall that the floor and the mod operations are 
computationally equivalent, i.e. mod can be computed in 0(1) operations using the floor 
operation.) 

We prove the correctness of the computation by induction. It is easy to see that 
a, = (2a) 2 . Recall that x — y divides x k — y k , for any positive integer k, therefore 
2a - &,-_! divides (2a) fc - of_ x , for k = 2 2 '" 1 . Thus, 

bi = a t mod (2a — i;_i) 

= (2a) 22 ' -1 - ((2a) 22 ' -1 - 6 2 !; -1 ) mod (2a - b^) 
= bf^' 1 mod (2a-bi-i). 



Observe that b}2 x = 2 2 ' . We get that if bf^ < 2a - b^ then b { = 2 2 ' . By the 
induction hypothesis 6 t _i = 2 2< ' _1) < a. Thus, if 2 2 ' < a, then 2 2 ' < 2a - &,-_!. Clearly 
the computation is done in 0(i) steps. 

2 

Seemingly, the above procedure gives us a way to compute 2 2 ' for i — 1,..., 
[\/log log a J . However, note that y^log log a\ is not known in advance. Thus, we are left 
with the following problem: How can we identify when to stop? That is, how to find the 

2 2 

first i for which 2 2 ' > a and thus 6,- ^ 2 2 ' . We cannot spend the time testing whether 
b{ = 6 2 _! by successive squaring, for each i, since it requires too many steps. 

2 

The solution is to compute another variable c?,-, such that d, = 2 2 ' , for i = 1, ...,/ + 1, 
where / is the greatest integer such that 6, = 2 2 ' . This implies that d, = b { for i < I and 
di+i 7^ fy+i- This allows us to detect the termination by simply comparing bi to d,- and 
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Applications 

We can now show a few applications of the powers of two subroutine. The first application 
is computing l>/logloga . Let t be the last iteration in which b t = d t . Then, 



V log log a 



= < 



t - 1 if b t > a 
t if b t < a 



Thus, we can compute I v/logloga in O(y/\ogn) time, for all n-bit integers. 

We extend the result of computing >/log log x and show how to compute [log log xj . 

2 

Using the powers of two procedure we can compute in O(t) steps, the number 2 2 , such 
that 2 2 < a < 2 2 . The idea is that in 0(t) additional steps, we can find a j such 

that 2 2J < a < 2 2J+1 . 

2 

This is done by simply squaring 2 2 successively, until the first time it is larger than 
a. Note that since (t + l) 2 — t 2 = 2t — 1, at most 2t — 1 successive squaring will be 
performed. We conclude that given an n-bit input a we can compute j = [log log aj 
in 0(v/logloga) = O( v /logn) time. In Chapter 3 we proved an fi(y / Iog"n) time lower 
bound for the respective decision problem. Therefore we have proved the following tight 
bounds. 

Theorem 6.5 There is a decision tree with OP — {+,—,*,/, |_-J } and constants 
{0,1} that computes [log log a\, for all n-bit integers a, and has depth 0(y/logn). Fur- 
thermore, any decision tree for this problem has depth n(y/\ogn). 

We can also use the powers of two subroutine to show that for infinitely many n, there 
is a decision tree, of depth (9(x/logn), that decides if an n-bit integer is a perfect square. 
In this case we restrict the integer input to be from the interval [2 n_1 , 2"). Suppose that 
n = 2' 2+1 . In 0(y/logn) time we can compute 2 2 '' = 2 n/2 , and 2 2(t+1) ' > 2 n , starting 
from the constants zero and one, and the input. Using Lemma 6.4, with the initial point 
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Conclusions 



In this Chapter we give a brief overview of the results that were presented in this thesis. 
We also mention a few open problems and possible directions for future research. 

In Chapter 3 we developed a lower bound technique for decision tree with operations 
{ + >—,*,/, [•}}, This lower bound technique could be used to prove tt(s/\ogn) lower 
bounds for problems, such as deciding perfect squares, computing (JoglogzJ, and other 
problems. 

As a historical remark, it is interesting that when the lower bounds were proven, there 
was a gap between the lower bound of Q(y/logn) and the upper bounds of O(logn). At 
that time it was believed that the gap could be closed by improving the lower bounds. 
Only later, when considering closely the lower bound argument to see where it could be 
tightened, did the technique of achieving the upper bound emerge. This, and the other 
upper bounds in Chapter 6, show that the floor operation can add significant power to 
the computation. 

In Chapter 4 we have proved an f)(log log n) lower bound on the depth of any compu- 
tation tree with operations from the set {+, -,*,/, |_-J }, that decides if two n-bit integers 
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are relatively prime. An upper bound for this problem can be obtained from the parallel 
algorithms of [CG90, KMR84]. Analyzing those algorithms, with respect to the number 
of arithmetic operations, reveals an upper bound of Oinj logra) for computing the gcd. 

Notice that the gcd can be written as an Integer Linear Program. Therefore, one 
of the consequences of our results is that there is no algorithm for the Integer Linear 
Programming problem, using operations only from the set { + ,—,*,/, |_- J } , whose running 
time depends only on the number of variables and the number of constraints, and not on 
the size of the coefficients. 

In Chapter 5 we prove lower bounds for approximating the s ih root of a real number. 
We show that approximating the s ih root, such that the average error is less than e, 



requires at least O(wloglog ^) operations. On the other hand, in Chapter 6, we prove 
an O^loglog i) upper bound, when the computation receives both the number to ap- 
proximate and the accuracy parameter e. This procedure approximates within e in the 
worst case (i.e. for each input the approximated value is within e of the correct value). 
Bashouty [Bas90b] has shown an fl(loglog i) lower bound for worst case approximation. 
It still remains as an open problem the complexity of the average approximation for the 
s th root. 

We know that if the intermediate results in the computation can be exponential in the 
size of the input then any PSPACE computation can be done in a polynomial number of 
operations (see [BMS81]). On the other hand, if the intermediate results are restricted 
to be polynomial in the size of the inputs, then any computation with a polynomial 
number of operations is in polynomial time. One very interesting direction of research 
would be to show tradeoffs between the size of the intermediate results and the number 
of operations required. 
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CHAPTER 6. UPPER BOUNDS 



input: a 




initialization: b\ — 2 2 , a\ — (2a) 2 , c\ 


= (2a) 4 , di = 2 2 , and i = 1. 


computation: 




WHILE b t = d { DO 




i = i+ 1; 




a, = a-_j; 


6, = a, mod (2a — 6,-_i) 


C ! — C t— 1) 


d, = Ci mod (a,- — d,_i) 


END 




output d{ /* di = 2 2 ' 2 */ 





Figure 6-3: Computing powers of two 



terminating when they are not equal. It remains to show how to construct d,-. 

In addition to the computation of a,- and &,-, we compute Cj and d,, where: ci = (2a) 4 , 
d\ = 2 2 , c^ = c\ti and d,- = c, mod (a, — d t _i). It is easy to see that c,- = (2a) 2 '" . Notice 
that c, = of ' . Suppose that 6,_i = d,_x = 2 2 ' _1 , and consider the i th iteration: 



di = d { _ x mod (a,- — o?,--!). 



This implies that if d 2 ^ < a< - d,_x then d; = 2 2 ' . Clearly, if df\ < 2a - 6,_i and 
6i_i = d,_!, then d 2 ^ < a, — d,_x. We conclude that for all i such that 2 2 ' < 2a — 6,_i, 

2 -2 2 

bi = d,- = 2 2 ' . Consider the least i such that 2 2 ' > 2a — 6,_!. Clearly, for this i, bi ^ 2 2 ' . 
However, we claim that d,- is still equal 2 2 ' . This is true since 2 2( ' -1) < 2a — 6,-_ x , 
implying that 



2 r = (2 2(-) f<-> < {2a _ 6j _ i)2 - < (2fl)a «- _ £•- = fl . _ ^ 



Thus, the /?r\s£ time 6, 7^ d,, we stop the computation. Let t be the last index such 
that b t = d t , this guarantees that b t — 2 2 ' , and a < 2 2( ' +1) (Figure 6-3 gives a full 
description of our algorithm for computing powers.) 
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such that if y/x < x Q < \\/x and 2 l > x, then 

\T{x,x ,2 l )-^\<\. 

Proof: Let n = /, d, — deg(Hi) and M, = max-coef{Hi). We show how to compute 
2 m > 2M t 2 2nd < in 0(t) steps. Once we show this, the rest of the algorithm and the proof 
is identical to the approximation case. Recall that M t < (M + l) 2 ' . Since x < 2', 
then M < 2', which implies that M t < (2' +1 ) 22 ' = M. Since d t = 2 l and 2 n < 2 l , 
we can compute 2 m = 2(2') 22 ' +1 > 2M2 2d ' n in 0{t) steps. For t - O(y/\og\ogx), the 
above computation time is 0(v/loglog x). Therefore the total running time of the entire 



procedure is 0{\J\og\ogx). D 

Later we show how to use the above lemma to prove that for infinitely many n, for 
any n bit integer, we can compute a "good" initial point in 0(i/logn) steps. 



6.3 Computing powers of two 

2 

Suppose that we want to compute 2 2 ' , for some i. Intuitively, it seems that the fastest 
way to do it is by successive squaring. This gives an 0(i 2 ) step procedure for computing 
this number. Surprisingly, using the floor operation, and given a large enough number, 
we can compute this power in 0(i) steps. 

The main idea is based on the following observation. Let a, b and k be positive 
integers, and a > b. Then, 

a k mod (a - b) = ba k ~ l mod (a - b) = b k mod (a - b). 

If b k < a — b then the value of the above computation is simply b k . That suggests that 
once we raised a to the k th power, we can raise b to the k th power in 0(1) operations. By 
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already computed 2 m , we can compute c# t = H t (2 m ) in 0(t) steps. To summarize, so 
far we have computed 2 m , 2 md and c]} t in 0(t) steps. Thus, we have computed all the 
constants necessary for Theorem 6.1, and therefore an evaluation of H t can be performed 
in 0(1) operations. 

The computation evaluates H t k times; the input to the i ih evaluation of H t , x,-, is 
the output of the {i-l) th computation, i.e. x; = if t (x,_i). Each evaluation of H t costs 
only 0(1), therefore the length of the program is 0(t + k), and it performs tk Newton 
iterations. This implies that it can perform (9(loglog ^) iterations in 0(J\og\og j) steps 
(by choosing t = k). 

However, we are still left with a slight implementation problem. Notice that x,, for 
i > 0, may be non-integer, while the polynomial evaluation procedure is valid only if x,- 
is an integer. (Another condition is that x, < 2 n , but this is always true.) To fix this 
problem, suppose that in each iteration instead of computing x,-, we compute the smallest 
integer greater than Xj. Formally, define the modified iteration as follows. 



'«+i 



' Ft(xi) 
G t (xi) 



We claim that this does not change the convergence rate by more than a constant factor. 
Let x, = (1 + 8i)y/a. It can be verified that, 

|^ +1 |<2(!) 2, + -L 
2 V" 

As long as <!>,■ > -^, this implies that 2(^y L ) 2 ' > -4-, and therefore, 

Si < 4(%if < C 



On the other hand, if < £,• < 4^, then yja < x,- < y/a + 2. This implies that 
|-\/^ — (^i '■ ~ 1)1 ^ 1- Therefore, we can set a = i^ — 1, and the approximation to yb 



78 



CHAPTER 6. UPPER B UNDS 



input: 
computation 


b,e 

: 


N 


>| 










output ft. 


a = 

ft = 


Rt 

at 
2 


J 

(a)/* 


invoke the 


polynomial evaluation 


procedure 


V 



Figure 6-1: Approximating the square root non-uniformly 



Qi(y) = 2xq. In general, Xi is obtained by evaluating a rational function 



Riiv) 



^i{y> 


* ' \Qi-i(v)> 


yW-i(y) + rf-i{y) 


Qi(y) 


z Q.-i(y) 


2P i „ l (y)Q i _ 1 (y) 



at the point y = a. This implies that P { (y) = yQti(y) + P?-x{y) and Q,-(y) = 2P,_ 1 (y) 
Qi-i(y), for i > 1. (Note that the coefficients of P { and Qi are integers.) We are interested 
in computing a = R t (a) - P t (a)/Q t (a). Note that given x and t the polynomials P t and 
Qt are defined. Furthermore, x Q depends only on e, and not on a. Since, the coefficients 
of P { and Qi are integers, by Theorem 6.1, we can compute P t {a) and Q t (a) in constant 
time, for any n-bit integer a. Hence, we can compute a — R t (a) in constant time. 

Let (3 — ae/2. Since a can be computed in 0(1) steps, ft can be computed in 0(1) 
steps. As we discussed before, \y/b - ft\ < e, therefore we have shown the following 
theorem. 



Theorem 6.2 There exists a constant C such that, for any < e < 1 there is a 
straight line program L t of length C , such that for any h G [1,2], 



\Vb-L t (b)\ <e 
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of 7(a)c™. More formally, compute 

(3 = [~f(a)c™2- md \ - [y(a)c™2- md - m \ 2 m . 

We claim that P(a) = 0. Consider the product 7(0)0™, then 

7(«)< = EEp' fl, ' 2m((i " i)+i) 

! = j = 

d min{d,d-\-ky 

= E ( E P,-a , '"*)2 m( * + '' ) 

k——d i=max{0,A:} 

= E ^^ 

fc=-d 



Note that P(a) = J2 d =0 Pia l — g . For ?n, such that 2 m > max,^}, the m-bit integer 
obtained by considering bit positions md to m(d + 1) — 1 of 7(a)c£ l is <7o = 0- This 
concludes the description of the polynomial evaluation algorithm. Below we bound the 
value of m that meets the above requirements. 

Let, max-coef(P) = M. Since a < 2", then the value of a*' is bounded by 2 n *. Thus, 
each g k is bounded by dM2 nd . In step 1 we required that m > n(d + 1) + 1, therefore it 
is sufficient to chose m > 2nd + [log M] + 1. (Note that this implies that the size of the 
constants used are polynomial in d, n and logM.) 

Recall that a straight line program is a computation tree without any comparison 
nodes. Thus, we have shown the following theorem. 

Theorem 6.1 There exists a fixed constant C such that, for any polynomial P with 
integer coefficients, there is a straight line program L of length C, that computes P(x) 
for all n-bit positive integers. If deg(P) = d and max-coef(P) = M, then the constants 
used in L are 2 m , 2 md , and c™ , for any m > 2nd + flog Af\+1. 
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of the polynomial. In section 6.2 we apply the techniques developed for evaluation of 
polynomials to the problem of approximation of square roots. For the non-uniform case, 
we show a straight line program that approximates the square root in 0(1) steps. For the 
uniform case, we show a computation tree T that, on input e > and a; € [1, 2] computes 



T{t,x) such that \y/x — T(e,x)\ < e, and the has length 0(\J\og\og i). In contrast, it 
is known that using only rational operation, O(loglog-) steps are both necessary and 
sufficient (see [Bra86]). 

Note that when we are considering approximation of square roots, we are considering 
two classes of algorithms. The first class is algorithms that receive only the input, x, and 
although they have a guarantee that the output is at most e from the square root, they 
do not use the value of e in the computation. An example of such an algorithm is the 
Newton iteration method; the lower bound that we show is for this class of algorithms. 
The second class, is an algorithm that has two inputs, the point x, and the desired 
approximation e. Unlike the previous type, here the algorithm uses the value of e during 
the computation. The upper bound belongs to the second class of algorithms. 

Section 6.3 shows that the floor function can be used to accelerate exponentiation. 

2 

Suppose we would like to generate the number 2 2 ' . Using repeated squaring this can be 
done in 0(i 2 ) steps. We show that this number can be generated in 0(i) steps, given 

2 

any integer x > 2 2 ' (and using the floor function). Using this procedure we show that 
part of the lower bounds proved in Chapter 3 are tight. 



6.1 Computing polynomials 

We consider the complexity of evaluating a polynomial, whose coefficients are integers, 
on an n-bit integer input. We show that this problem can be solved in constant time, 
using arbitrary constants and the floor function. 
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starts at v\, and ends at V{ +2 . We choose ja;+i, a !+1 + /,- +1 ] as a subinterval of [a;, a, + U]. 
Therefore, by the induction hypothesis we have that Properties 1-3 are satisfied by the 
interval [a,-+i, a,-+i + /, + i] and the prefix of V that starts at v-i and ends at u,-+i. In order to 
complete the proof of the lemma we need to show that (a) there exists an outgoing edge of 
Vi+i such that for any input x G [a I+1 , a t+1 -f /; +1 ] the computation follows this edge, and 
(b) Properties 2-3 are satisfied also for the vertex u,- +1 and the interval [a,+i,a; +1 + U+i]. 

By the definition of the tree T, the vertex v, + i is either a comparison vertex or a 

computation vertex. If it is a comparison vertex, then a comparison g(x) < h(x) is 

performed. If it is a computation vertex, then either f Vi+1 — r,-+i(x) = g(x) o h(x) 

for o 6 { + ,_,*,/}, or f Vi+l = [g(x)\ is evaluated. Here, g(x),h(x) <= {0,1} U {f Vj \vj 

is a computation vertex, j < i}. 

By the induction hypothesis, g(x) and h(x) are rational functions of degree less than 
Di and max-coef(g), max-coef[h) < Mi. 

The proof is based on a case by case analysis. In each case, we define the next vertex 
v i+2 on the path V, the interval [a,+i,a,+i + / l+1 ], and the rational function r,- + i(x) 
(whenever i>; +1 a computation vertex). 

First, we resolve the case when t>, +1 is a comparison vertex. The comparison is of the 
form g(x) < h(x). By the induction hypothesis, both h(x) and g(x) can be represented 
as polynomials of degree less than D t . Consider the rational function g(x) — h(x), which 
is of degree less than 2D,-. Use Lemma 5.2 and choose a subinterval [a i+1 ,a t+1 + y^-] 
where g(x) — h(x) does not have any poles or zeros. Set /,- +1 = j^. It is now easy to 
check that properties 1-3 are satisfied in this case. 

Next, consider the case when t> 1+1 is a computation vertex. The following possibilities 
may arise. 

Suppose that o <E { + ,-,*,/} . Let r i+1 (x) = g(x) o h(x), a i+1 = a,-, and /,- +1 = /,-. 
By Lemma 2.1 the degree of r t+1 (x) is less than 2A, and its maximum coefficient is 
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max-coef{r) < M , there exists a subinterval [a, a + , ^. . 4 ] such that for any v in the 
subinterval, \v^ s - r{v)\ > (2{2M) 2 )- 1 (^- 3 Y sd+1 ^ . 

Proof: Let x" = v, such that y/a < x < \/a + j- < \/a + / Consider the rational function 
E(x) = x— r(x s ). Clearly, the degree of E(x) is bounded by sd+1 and max-coef[E) < 2M. 

By Corollary 5.8, there exists a subinterval [/?,/? + A] C [\/a, \/a + {;], such that 
A = iM .w - and for any x G \P>P + X ]> 1^)1 ^ (2(2Af) 2 )- 1 (^)( sd+1 ). 

Let a = /? s and the subinterval be [a, a + sX/2]. For any v € [a, ct + sX/2], then 
x 6 [/?,/?+ A]. This implies that on the subinterval [a,a + sX/2], then difference between 
v 1 ' 3 and r(v) is at least (2(2M) 2 )- 1 (^ sd+1 K □ 



5.5 Lower bound for approximation 

In this section we use the technique that we developed in the previous section to establish 



the lower bound on approximating ^/x. We show an Q(wloglog ~ t ) lower bound on the 
depth of any computation tree T, with OP — {+, -, *, /, [-J } and constants {0, 1}, that 
e-approximates -f/x, where 5 is any fixed constant, for x € [1,2], in norm L\. 

Theorem 5.11 For any integer s > 2, if a computation tree T , with operations 
OP = { + , — , *,/, (_-J) and constants {0, 1}, approximates y/x, for a fixed s and x € [1,2], 
such that 

errori(T(x), \/x) < e 

then T has depth h = ^(■i/loglog -). 

Proof: The bulk of the proof involves constructing (i) a path V from the root of T to 
one of its leaves, and (ii) a subinterval [a, a + I] of [1, 2], with the following properties: 

1. On input a e [a,a + I] to T, the computation follows the path V; and 
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Note that q{z) and p(x) have the same degree and, 

\b\(!-) d max{|g(*)|}= max {|p(x)|}. 

Z z6[— 1,1] x£[a,a+l] 

By Lemma 5.5, there exists z € [-1,1] such that \q(z )\ > 2~ d+1 . Therefore, K > 
2~ d+1 \b\ (t) d . (Recall that |6| > 1.) Hence K/2 > ({) d , which completes the proof. □ 

The following lemma combines the above lemma, which enables us to give lower 
bounds for the value of a polynomial, with the first lemma, that observed how to upper 
bound a polynomial, to establish a technique to upper bound a rational expression. 

Lemma 5.7 Given an interval [a, a + I], where 1 < a < 2 and < / < 2 — a, and 
a rational function r(x) of degree d with integer coefficients and max-coef(r) < M; there 
exists a subinterval [a,a+ j^], such that (i) r{x) is monotone in this subinterval, and 
(ii) |r(x)| < 2M( 8 j) d for all x 6 [a, a + ^}. 

Proof: Let r(x) = 2J|i where p(x) and q(x) are polynomials of degree at most d with 
integer coefficients. By Lemma 5.1, the maximum value of \p(x)\ for x € [1, 2] is bounded 
from above by M2 d+1 . By Lemma 5.2, there is a subinterval [/?,/?+ ^] in which r(x) is 
monotone. By Lemma 5.6, there is a subinterval of [/?,/?+ ^], denoted by [a,a+ jg-p], 
such that \q(x)\ > ({) d for all x G [a,a + ^]. Therefore, |r(x)| < 2M(f) d for all 
x £[<*><*+ sk*]- □ 

Let ri(x) be a rational expression and r 2 (x) = ^y. Since both the degree and 
the maximum coefficient of n and r 2 are identical, a lower bound for the value of r x 
would imply an upper bound for the value of r 2 . The following corollary formalizes this 
observation. 

Corollary 5.8 Given an interval [a, a + l], where 1 < a < 2 and < / < 2 - a, and 
a rational function r(x) of degree d with integer coefficients and max-coef{r) < M; there 
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We need to use a slight modification of Markoff inequality. The segments that we are 
interested in are not necessarily [—1,4-1]. Therefore the following formulation would be 
more convenient. 

Lemma 5.4 Let p(x) be a polynomial of degree d, and p'(x) the derivative of p(x). 

Then 

2d 2 
max {\p'(x)\} < — max {\p{x)\}. 

xe[a,a+l] I x£[a,a + l] 

Proof: Consider the substitution x = z| 4- 2 ^ ti . If z E [-1, 1] then x e [a,a + I]. Let 
q(z) = p{z{ + 2itJ). By Lemma 5.3 

^ax^l^z)!} < d 2 ma.x^{\q(z)\}. 

Since gg = g, and g = f , the lemma follows. □ 

The importance of this lemma is that using it we can claim, that if over the interval 
[a, a + /] the maximum value of a polynomial is p(x'), then there is a neighborhood of 
x', in which the value of the polynomial is Q(p(x')) and the size of the neighborhood is 
fl(l/d 2 ). 

The family of polynomials, known as Chebyshev polynomials, have many application 
in Numerical Analysis. We use here only one aspect of the Chebychev polynomials, and 
that is their ability to give a lower bound for the value of a polynomial. 

Lemma 5.5 (Chebyshev [Riv69, page 31]) Let p(x) be a polynomial of degree d>\ 
defined over [— 1, 1]. If the leading coefficient of p(x) is one, then there exists x Q G [— 1, 1] 
such that |p(xo)| > ^r- 

The importance of the above lemma is to enable us to establish a lower bound on 
the value of a polynomial. Once we are able to show a lower bound and upper bounds 
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proving a lower bound for rational expression, over [a, a + I]. 

The last part of the proof involves proving lower bounds for rational functions. We 
show that for a rational expression r(x), defined over [a,a + I], there is a subinterval 
[a, a + A], such that for any input x G [a, a + A], the difference between r(x) and -f/x is 
at least 8. This implies that 

\\r(x)-^\\ 1 >SX. 



5.4 Lower bound technique 

We would like to establish a technique to bound the value of a rational function over an 
interval. In general, a rational function may be unbounded; therefore we are seeking a 
"large" subinterval in which the rational function is bounded. The techniques that we 
develop show how to both upper bound and lower bound the value of a polynomial. The 
first lemma shows a very trivial upper bound on the value of a polynomial defined over 
[1,2]. Much of the remainder of the section is devoted to showing that given a polynomial, 
there is a subinterval in which we can lower bound the value of the polynomial. 

Lemma 5.1 Let p(x) be a polynomial of degree d and max-coef(p) < M. Then, for 
allxe [1,2], |p(x)| <M2 d+1 . 

Proof: Let p(x) = £? =0 a,-ar\ Then clearly, 

d d d 

\p( x )\ = I E a «' x 'l ^ £ kV < E M2 ' < M2d+1 

»'=0 t=0 t=0 

□ 

It would be conve. it that the rational functions that we are dealing with would 
not have any poles (i.e. inputs for which they go to infinity) or any roots. The following 
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Recently, Bashouty [Bas90b] has shown that approximating the square root of two, 
requires Q(loglog^) operations, when the only input is x, and e is not an input. This 
implies a lower bound in norm Loo. It is still an open problem what is the bound for 
norm L\, where the only input is the number to be approximated, and not e. 



5.3 Overview 

In this section we give a general overview of the proof that we show in the next two 
sections. At a very high level, we establish the proof along the following lines. First, we 
show that there is a "large" subinterval, such that all the inputs from this subinterval 
follow the same path. Second, the output that the computation tree computes for this 
subinterval can be expressed as a rational function. Third, any rational function, of a 
given degree and coefficient size, would have to be "far" from the s ih root function, on 
a "substantial" part of the subinterval. Clearly we would have a tradeoff between how 
"far" the functions are, and how "substantial" is the part of the subinterval on which 
they are "far". Combining those two parameters translates to a lower bound on the 
approximation in norm L\. 

Now we elaborate slightly more on each component of the proof. We start with 
the main tool that enables us to show how to handle the floor function. Consider the 
expression [r(:r)J, where r is a rational expression, defined over [a, a + /]. We show that 
there is a sub-interval, [a', a' + I'], such that the value of [r(x)\ is constant in it and the 
ratio between / and /' is bounded as a function of deg(r) and max-coef{r). The smaller the 
ratio between /' and /, the better lower bound we can prove at the end. The proof of this 
claim quite involved, and uses theorems from Approximation Theory, such as Chebyshev 
polynomials and the Markoff inequality. 

Given a computation tree T that receives an input x € [1,2], we show that there 
exists an interval, [a, a + /] C [1,2], such that any input x G [a, a + I] follows the same 
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example, that computing the average distance of n points in the plane requires exactly 
( "J operations. (For average distance, [SY76] showed that it can be approximated with 
substantially less than (") operations.) 

This work pursues the second direction. Rather than computing a function exactly, 
we are interested in approximating the function's value. The set of basic operations 
does not change and remains the set of rational operations and the floor function, i.e. 

{+,-*, A L-J}- 

This chapter is organized as follows. In the following section we define formally the 
meaning of an approximation. In section 5.3 we give a general overview of the lower 
bound technique that we develop. In section 5.4 we develop the lower bound technique, 
that we later use in Section 5.5 to prove the lower bound for approximating the s tk root 
of a real number. 



5.2 Definitions 

The main aim of this section is to define formally the notion of approximation. We start 
by a very brief background, and define inner product of real functions, and a norm of a 
real function. The inner product of two real functions / and g, over a region A, is defined 
as follows, 

< f,9 >= / f(x)g(x)da 



IX 



Given the definition of the inner product, We define the norm Lk, of a function /, 
over a region A, to be 



Wfh = V J A \f(x)\ k dx 
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tree are input vertices of the inputs a and b. If (u,v) is the generator of (a, 6), then, by 
Lemma 4.6, a and b are polynomials in u and v of maximum coefficient and degree less 
than 2 n and n 1 ' 5 , respectively. This implies that the number of monomials in each 
of these polynomials is at most n 2 / 5 and that the value of each monomial is at most 
2" 1/5 (2 2n2/i ) 2 " 1/5 . Therefore, a, b < n*/^ 1 '^ 2 "'" 2 " 1 '* = rc 2 / 5 2 4 " 4/5 < 2 n , for large enough 
n. 

Lemma 4.7 asserts that some pairs in the set (u,v) € S(0,cti,a2,oi3)n {(u,v) : 1 < 
it, u < 2 2 '( /+1 )} are relatively prime, and some are not. The Correspondence Property, 
Lemma 4.1, guarantees that the gcd of the inputs is the same as the gcd of the generators. 
Since the leaf u is labeled by a constant, the inputs that reach this leaf are either all 
relatively prime, or all not relatively prime. Since all the pairs reach the same leaf, we 
reach a contradiction. □ 
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(u, v) is the generator of (x, y). By Lemma 4.4, the degree of §4^4 o q^ v X is less than 
2D, < 2 2 , and its maximum coefficient is bounded by 2Mf(D{ + l) 2 < 2 2 



2«(» + l) 



Suppose v,- +1 is [j/J, where z/ is a previous computation vertex. By the induction 
hypothesis, val(v) = G "Ji^ v l , where (it, u) is the generator of (x, y). By Lemma 4.5, there is 
a rational expression Q(x, y), and a subset of the inputs <5>( t+1 ', such that val(v) — Q(u, i>), 
where (u,v) is the generator of (x,y). The degree of Q is at most Di+\ < ADf < 2 2 * ( ' +l) , 
and the maximum coefficient of Q is at most M, +1 < 2(4D, Mi) 2D3 < 2 22 

We are not done yet with this case. The substitution of u, by A,- + i(u,-+i) 5 ' +1 + u, +2 , 
affects not only the rational expression Q, which is stated in Lemma 4.5. It also changes 
all the polynomials in E*. Since 6,- +1 < D, . + 1, the degree of the rational expressions in E,- 
is at most D i+X < A(A + 1), and the maximum coefficient is at most M i+i < A^! 1 2 Di Mi. 
Note that the degree and maximum coefficients of previous vertices change, but they are 
bounded by the current D; and M t . □ 



4.4 GCD Lower bound 

In this section we show how to apply the proof technique that we developed in the 
previous section to derive a lower bound for computing the greatest common divisor of 
two integers. 

We start by showing that in any set of inputs S(-), there is one input that is relatively 
prime, and another one that is not relatively prime. 

Lemma 4.7 Let ai,a 2 ,ct3, and t be positive integers such that a x < t, and a 2 ,a 3 < 
2*. Then the set S(0,a u a 2 ,a 3 )n {(u,v) : < u,v < 2 2i(m) } contains two pairs (a ,ai) 
and (6 , b x ), such that gcd(a , a\) ^ 1 and gcd(6 , W) = 1. 

Proof: Let e be the least positive integer exponent such that (1 + a 3 ) e > a 2 . Define 
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Lemma 4.6 Let T be a computation tree of depth h, with two inputs, the operations 
{ + ,—,*,/, ['J } and constants {0,1}. Then, there is a path V from the root of T to a 
leaf, and a subset S of inputs, with the the following properties: 

1. S = S(r, a>i,a 2 ,a 3 , A, A)n {(a, b) : < a,b < 2 n }, for some positive inte- 
gers r, a 1 ,a 2 ,a 3 ,S 1 ,6 2 ,. ..,8 T , A X ,A 2 ,. . . , A r , where A = (8 1 ,8 2 ,. ..,S r ), and A = 
(Ai, A 2 , . . . , A r ); 

2. For each input (a,b) £ S. the computation follows the path V; 

3. For each computation or input vertex v on the path V , there is a pair of bivaria 
polynomials (F l/ (x,y),G L ,(x,y)) with integer coefficients, such that for each in r 
(a,b) e S, G„(u,v) ^ 0, andval(f) = §fy$, where (u,v) is the < r, a u a 2 , a 3 , A, A >- 
generator of(a,b); i.e., the value computed at v on input (a, b) £ S, is the value of 
the rational expression ^y' y \ at (u.v); and 

r G v (x,y) V i n 

4. Let E = {F u (x,y),G„(x,y) \ v G V}. Define D and M to be the degree and the 
maximum coefficient of E, respectively. Then, r < h, max{ai,jD} < 2 2 * h , and 
max{Q 2 ,a 3 ,M} < 2 22 . 

At first sight it might be surprising that the above lemma does not claim explicitly 
any bound on the values in A and A. The reason is that we can add the input as two 
input vertices in the tree T, and therefore, we express the input as a rational expression 
of the "generators". This guarantees the connection of the values in E with A and A. 

Proof: We denote the vertices on the path V by Vi,v 2 ,. . . ,vi, in that order, where v l 
is the root of the tree T, v { is a child of u.-.j, and v t is a leaf of the tree T. The path 
V and the set S are defined inductively, starting with the path v x ,v 2 ,v 3 and the set 
5 (2) = S(Q, 1,0, 1) (which consists of all pairs (a,b), where a > b > 0). Following that 
proof, suppose that (a) we have selected a prefix of V, which starts at v u and ends at 
a vertex v i+1 , and (b) constructed the set S& = S (r^, a{ {) , a { 2 i} , a { ^\ A<*'>, A<*'>) with the 
following properties: 
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and, 

Q , 2 = max{a 2 ,7r 2 (^),7r 2 (I < 'P 2 -^)} < M D+1 + ((£> + 2)M) D+2 + 1 

< 2((D + 2)M) D+2 . 

This guarantees that for any (u,v) € 5(0, o^, a 2 , ^0:3), < R(u,v) and R(u,v) < 
L d P2(u,v). This implies that 

R(u,v ) 
L d P 2 (u,v) 

We conclude that for (x,y) G 5(0, a'j, a 2 ,7ra3), 

Clearly, t/e^(Q) = deg(A) < D, and max-coef{Q) < max-coef{A) + 2M D < 2((2 + 
D)M) D+1 . 



In Case 5 we show how to reduce the general case to Case 4. In order to summarize the 
effect of Case 4, observe that max{a' l5 deg(Q)} < (D+l) 2 , and max{a' 2 , 7ra 3 , max-coef(Q)} 
<2((D + 2)M) D+ \ 

Case 5: The leading monomial of P 2 (x,y) is Lx d *y'. Our goal is to reduce this case 
to Case 4 where no powers of y appear in the leading monomial. We introduce a new 
variable z and substitute x using it. Let 6 — max{a l5 <fe<7(P 2 ) + 1} < D + 1 and A — a 3 . 
we substitute x by Ay* + z and consider the polynomial Q(y, z) = P 2 (Xy s + z, y). Observe 
that the leading monomial in Q{y, z) is a constant times a power of y, i.e., no power of z 
appears in the leading monomial of Q(y,z). Thus, we have reduced this case to Case 4. 
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As in Case 3, let A(l, 1) = cL d + 7, where c is an integer and 7 is a non-negative 
integer such that < 7 < L d . 

Pi(u,v) _ ~fP 2 (u,v) + R(u,v) A(u,v) -7 



P2(u,v) 



L d P 2 {u,v) 



L d 



Let 7r = L d . Then, for each (u,v) € S(0, d l5 d 2 ,7ra 3 ), A(u,v) = 7 (mod L d ). Hence, 
for each such pair (u,u), the expression ^"'"J~7 evaluates to an integer. Therefore, 



P 2 («,t;) 



lP 2 {u,v) + R(u,v) 
L d P 2 (u,v) 



+ 



A(u,v)--f 
L d 



Our aim is to restrict the inputs such that the value of [ ^fa'Qffi'^ J is fixed. We 
distinguish between two subcases, according to the value of 7. 



Subcase J^.l: 7 > 0. In this subcase we restrict the input, such that the value 
°f \ * l"'AkJ) J * s zero ' Consider the polynomial V(x,y) = L~ d (-y P 2 (x,y) 
+R(x,y)). Since deg x (R) < deg x (P 2 ) the leading coefficient of V is jfrL. Let 
B(x,y) = P 2 (x,y) - V(x,i/). The leading coefficient of B(x,y) is (1 - ^)L. Since 
< fa- < 1, the leading coefficient of B(x,y) is positive, i.e. sign(B) — +1. Recall 
that 7T = X rf . Let, 

a' 1 =max{di,7r 1 (5),7r 1 (7P 2 + i2)} < D(D + 1) + 1 

< (^> + l) 2 , 



and, 



a' 2 = max{d 2 ,7r 2 (£), 7^(7^2 + #)} < M D+1 + ((D + 2)M) D+2 + 1 

< 2((L> + 2)M) D+2 . 
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3. P 2 (x,y) is a constant, i.e., d x — d y — 0. 

4. / = 0, i.e. y does not appear in the leading monomial of P 2 (x,y). 

5. / > 0, i.e. the general case. 

Case 1: Pi(x,y) = P 2 (x,y), for (x,y) <E 5(0, a u a 2 ,a 3 ) Let Q{x,y) = 1/1, tt = 1, 
a\ = a.\ and a' 2 — a 2 . 

Case 2: Pi(x,y) -< P 2 (x,y). Let B(x,y) = P 2 (x,y) — Pi(x,y). Since the leading coeffi- 
cient of P 2 is positive and greater than the leading coefficient of Pi, then sign(B) = +1. 
Using Lemma 2.3 let a[ = max{ai, Ki{B)} and a' 2 = max{a 2 ,7r 2 (.B)}. Lemma 2.3 guar- 
antees that for each (u, v) <E 5(0, a\,a 2 , 1), B{u, v) > 0. Observe that B(u, v) > implies 
that P 2 (u,v) > Pi{u,v). Since a\ > 7r a (P 2 ) and o/ 2 > 7r 2 (P 2 ), then P 2 {u,v) > 0. Thus, 

P2(u,v) < • 
For each (u,v) 6 5(0,a' x , a' 2 , 1) both Pi{u,v) > and P 2 (u,v) > 0. Therefore, 

„ Pi(u,v) 

°<^f J -4< 1 - 

P 2 {u,v) 
Let 7r = 1. This implies that for (x,y) 6 5(0, a'j, a 2 , 1), 



Pi(n,v) 
P2(u,v), 



= j = Q( x ,y) 



Clearly, deg(Q) = and max-coej[Q) = 1. Since deg(B) < D, and max-coef(B) < 2M, 
then by Lemma 2.3, a[ < D + 1 and a' 2 < 2M + 1. 

Case 5: P 2 (x,y) is the integer Z, > 1. Recall that all coefficients of P x {x,y) are integers, 
and P x {x,y) > 0, for (x,y) e 5(0, d x , d 2 , !)• Let Pi(l,l) = cZ + 7, where c is an integer 
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Otherwise, let B(x,y) = P(x,y) - Q{x,y) = fijfji. Since P ^ Q then sign{B) ^ 0, 
which implies that sign(Bi) ^ and sign(B 2 ) ^ 0. By Lemma 2.2, deg(B) < 2D, and 
max-coej{B) < 2(D + 1) 2 M 2 . Let a[ — max{ o x , k 1 (B 1 ), tti(B 2 )}, and a 2 = max{ 
a 2 , t 2 (Bi), 7r 2 (i?2)}- Lemma 2.3 gives the bounds a[ < max{ai,2.D + 1} and a 2 < 
raax{ai,2(D + 1) 2 M 2 + 1}. Since the comparison P(x,y) > Q(x,y) is equivalent to 
B(x,y) > 0, the lemma follows. □ 

The next step is showing the effect of rational operations on our parameters. This 
case is a rather simple case. 

Lemma 4.4 Let P(x,y) - §4^4 and Q(x,y) = 3}\ x,y \ be two multivariate rational 
expressions, where deg(P), deg(Q) < D, and max-coef[P) , max-coef(Q) < M. Then, 
deg(R oS)< 2D, and max-coef(R o S) < 2(1 + D) 2 M 2 , where o e {+, -, *,/,}. 

Proof: A special case of Lemma 2.2. D 

In the following lemma we show how to "handle" the floor operation, which is the most 
interesting case. As for one variable, the main objective is to find a rational expression 
that coincides with the value of the floor operation on some subset of the inputs. In 
order to get control over the degree and maximum coefficient of the rational expression 
we restrict the input to a subset of the previous inputs. The proof is similar in spirit to 
Lemma 3.4, and is also done cases. The cases represent different relationships between 
the lexicographic order of the two polynomials in the rational expression (to which the 
floor is applied). The main conceptual difference with the one variable proof technique 
is in Case 5 of the proof, where we introduce new variables. 

Lemma 4.5 Let P(x,y) = pj x x ' v l be a rational expression with integer coefficients, 
defined over 5(0, a u a 2 ,a 3 ), such that P(x,y) > for (x,y) € 5(0, a x , o 2 ,a 3 ). Define 
max{4, ai, deg(P)} = D, and max{4, a 2 , a 3 , max-coef(P)} = M. Either 



42 CHAPTER 4. LOWE BOUND FOR GCD 

The intuition is that the sequence uq, . . . , u r+i includes all the variables that we 
introduce. Notice that equations 4.1 and 4.2 imply that u > Ui > u 2 > u 3 > . . . > u r > 
u r +i- The following definition relates the first pair (u ,Ui), and the last pair (u r ,u r+ i). 

Definition 7 Let S(r, ot\, 0:2,0:3, A, A) denote the following set of ordered pairs of 
positive integers: 

{(uo,iti) : there exist integers u r ,u T+ x such that (uo,ui) 

is <r,a\, a 2 , 0:3, A, A > —generated by (u r , u r+i )} . 

For convenience we omit the null vectors A and A whenever r = 0. In this case, the set 
S(0, oti, 0:2,0:3) consists of all pairs (u ,ui) such that u > (ui) ai , u\ > a 2 , u = ui = I 
(mod a 3 ) (in accordance with the above definition). 

Perhaps the most important characteristic of the above definitions is its similarity to 
the Euclidean algorithm for solving the gcd problem. As an immediate consequence of 
the definition, we get the two properties stated below. These properties are the key to 
our proof strategy. 

Lemma 4.1 (The Correspondence Property): There is an one-to-one corre- 
spondence between the elements of the sets S(r, Qi,q 2 , 0:3, A, A) and £(0, 01,0:2, a 3 ). 
Specifically, each pair («o,«i) € 5(r, ai, q 2 , 03, A, A) corresponds to the unique pair 
(u r ,u r+ i) € 5(0,01,02,0:3) such that (u r ,u r +i) is the < r, ai,o 2 , 03, A, A > -generator of 
(u ,ui). Furthermore, if(u Q ,Ui) corresponds to (u r ,u r+1 ) then gcd(u , "i) = gcd(u r ,u r+1 ). 

Proof: Let a 2 > a$ > . . . > a r+ i and b 2 > 63 > . . . > b r+i , be the generating sequences 
for (a ,ai) and (6 ,&i), respectively. It is easy to check that if (a,-,a t+ i) ^ (6,-,6, + i) for 
some < i < r, then (a,-, a j+1 ) ^ (bj, b J+1 ) for any j, < j < r. The assertion about the 
gcd's follows from the Euclidean algorithm. □ 
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We give here a very brief description of the differences between the technique for 
two and one variables. The main difference between the two proof techniques is in 
the handling of the floor operation. Consider the expression mm. If we assume that 
x > y > 0, then we can claim that the value of this floor operation is zero. On the other 
hand, consider - . Since we assume that x > y > 0, we have to find a new solution for 
such an expression. We overcome the problem by adding a new variable, z, and requiring 
that both y > z > 0, and y + z = x, which implies that - = 1. In some sense we 
transformed a computation based on the inputs x and y to a computation based on the 
input y and the "new input" z. The problem is that we introduce a new dependency 
between x and y, e.g. in our example x = y + z < 2y. The general solution has a similar 
flavor: we substitute x by \y s + z. The values of 6 and A are chosen to guarantee that 
previous requirements on the relationship between x and y hold. 

The introduction of a new variable is the main difference between the technique for 
two variables and for one variable. Note that we may have to introduce many new 
variables, one for each floor operation. This causes additional complications in the proof 
technique that we need to resolve. 

Using this proof technique we show a lower bound on the depth of any computation 
tree with OP = { + ,-, *, /, [-J }, that computes the gcd of all pairs of n-bit integers. The 
important part of the lower bound is that it is non-constant; quantitatively, the lower 
bound is fi(loglogn). 

This chapter is organized as follows. In section 4.2 we describe the way that we 
structure the inputs that will be used in the proof. In section 4.3 we develop the proof 
technique. In section 4.4 we use the proof technique to show a lower bound for computing 
the greatest common divisor of two integers. 
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3.3 Applications 

We start this section by characterizing a class of problems for which our technique gives 
non-trivial lower bounds. In the previous section we showed, that given a computation 
tree, we can find a path and a set of inputs S(n, A), such that all those inputs follow the 
path. The following defines an M (n)- invariant function to be one which has two inputs 
in S(n, A), for which the output is different. 

Definition 5 Let f(x) be a boolean function whose domain is the set ofn-bit integers, 
and let M(n) > 0. Then f is M(n) -invariant if, for any integer A < M(n), there are 
two n-bit integers a a and a 2 , satisfying (i) a x = a 2 = mod A, and (ii) f(a-i) ^ f(a 2 ). 

The following theorem states that the lower bound technique that we developed in 
the previous section gives a lower bound for any M(rc)-invariant function. 

Theorem 3.6 Any decision tree with OP = { + ,-,*,/, |_-J} and constants {0,1}, 
that computes an M(n) -invariant function, f(x), for all n-bit integers, must have depth 
J^loglogMCn)). 

Proof: Since / is a boolean function, we can assume that the leaves are labeled by 
either zero or one. (This adds at most two to the depth of the tree.) We prove the 
lower bound by contradiction. Suppose that we are given a decision tree T of depth 
h < \ yjlog log M(n). that decides /. In Theorem 3.5 we prove that there is a A < 
2 2 < M(n), such that all the inputs in 5(n,A) reach the same leaf. Since / is M(n)- 
invariant, there are a a and a 2 , such that a x ,a 2 e S(n,X) and /(ai) ^ /(a 2 ). 

Both a x and a 2 follow the same path in T. Therefore T either accepts or rejects both 
f(a x ) and f(a 2 ). This contradicts the fact that T computes /. D 

Clearly, the larger the value of M(n), the better the lower bound that we can show. On 
the other hand, since M{n) < 2", we can not show lower bounds greater than fl(,/Iogn). 
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This enables us to transform all the floor operations, on a specific path, to evaluations 
of rational expressions. Lemma 3.2 gives a way to restrict the input such that the 
comparison value is fixed. This will enable us to find an integer A and a path "P, such 
that any integer in S(n,\) follows the path V. Lemmas 3.2, 3.3, 3.4 also bound how 
fast the degree and the size of the coefficients of the rational expressions can grow. The 
bounds on the degree and maximum coefficients enable us to bound the value of A, which 
is essential for showing the lower bound on complexity. For example, note that if A > 2 n , 
then S(n,\) — 0, which would not enable us to derive any interesting lower bounds. 

Theorem 3.5 Let T be an decision tree of depth h with OP = {+, — , *, /, [_ - J }• There 
is an integer A < 2 2 and a leaf vi of T , such that for each input a 6 S(n,\), the 
computation follows the path from the root ofT to its leafvi. 

Proof: Denote the vertices on the path V from the root of T to its leaf vi by t^, u 2 > ■ • • , v h 
in that order, where v x is the root of the tree T and v, is a child of i>,_i. We define the 
path V and the integer A inductively, starting with the empty path and A (0) = 1. (S(n, 1) 
consists of all n-bit integers.) As part of the induction hypothesis, we maintain three 
properties of the path and the set under consideration. These properties are described 
below. 

Suppose that (i) we have selected a prefix of V, which starts at v u and ends at a 
vertex v i+i , and (ii) defined a parameter A^) with the following properties: 

1. For each input a 6 5(n,A (,) ) the computation follows the path from the root to 
t>i+i; 

2. For each computation vertex v on the path from the root to the vertex u,-+i, ex- 
cluding the vertex u,- +1 , there is a pair of polynomials (F v (x),G v (x)) with integer 
coefficients, such that for each input a <E S(n, \ {i) ), G„(a) ^ 0, and val{v) = §^±. 
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Case 3: P 2 {x) -< P\(x). Let L be the leading coefficient of P2{x). Note that since 
sign(P 2 ) = +1, then L > 0. Corollary 2.7 implies that 



A(-r) 



A(a)P 2 (s) + R(x) 



where A(x) and R(x) are polynomials with integer coefficients such that, max-coeJ{A) < 
2 D M D+ \ max-coef[R) < 2 D+1 M D+2 , deg(A) < D and deg(R) < D. 

Consider the constant term of A(x). We denote this constant by cL d + j, where c is an 
integer and 7 is a non-negative integer such that < 7 < L d . Let A(x) = A{x) + cL d + j; 
that is, A(x) is equal to the polynomial A(x) minus its constant term. The parameter 
7r will be chosen to be a multiple of L d . This implies that for each a G S(n, \ir), each 
monomial of L~ d A(a) evaluates to an integer. Hence, for each integer a € S(n, \ir), 



A (a) 
P 2 (a\ 



—A(a) + c + 



1 P 2 {a) + R(a) 
L d P 2 (a) 



We distinguish between two subcases: 



Subcase 1: 7 > 0. Consider the polynomial V(x) = L- d (fP 2 (x) + R(x)). The 
leading coefficient of V is L~ d (~fL), therefore sign(V) = +1. Let B{x) = P 2 (x) - 
V(x). The leading coefficient of B(x) is (1 - L~ d -/)L. Since < L~ d -f < 1, the 
leading coefficient of B(x) is positive, i.e. sign(B) = +1. Using Corollary 2.4, let 7r 
be the minimum multiple of L d such that tt > max{7r (B),tt (7 P 2 + R), -k (Pi), tt(P 2 )}. 
Consider a e S(n,Xv). Since n > x(B), then V(a) < P 2 {a). Since x > tt(~jP 2 + R), 
then < V(a), which implies that < V(a) < P 2 (a). Since tt > w(P 2 ), then P 2 (a) 
> 0. This implies that < -^ = ^% a ^ a ) < 1, for any a > tt. We conclude 
that for each a € 5(n, A71-), 



L^2(a)J 



L~ d A(a) + c. Let 



g( x ) = j4(x) J ° L = [P( x )\ for x € S(n, Att). 
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The following claim shows how we can restrict the input by changing the parameter of 
S(-). Namely, multiplying the parameter by another integer and considering only integers 
that are divisible by the product. 

Claim: 3.1 For any integer n > 1, S(n,\7r) C S(n,X). 

The following lemma enables us, each time the computation reaches a comparison 
vertex, to fix one of the directions. This involves restricting the inputs, from 5(n, A) to 
a subset of it, S(n, \ir). An important part of the lemma is to bound the value of 7r. 

Lemma 3.2 Let P{x) and Q(x) be two rational expression, of degree at most D and 
maximum coefficient at most M, defined over S(n, A). There exists ir < 2(1 + D)M 2 + 1, 
such that for all x G S(n, Xn), the comparison P(x) > Q(x) is determined (i.e. either 
P{x) = Q{x), P{x) > Q(x), or P(x) < Q(x), for x e S{n, Xn)). 

Proof: Let B{x) = P(x) - Q(x) = gg}. If P{x) = Q{x) for x € S(n, A), let tt = 1, and 
the lemma follows. Otherwise both sign^) ^ and sign(B 2 ) / 0. By Lemma 2.2 the 
degree of B is at most 2D and max-coef(B) < 2(D + l)M 2 . 

Corollary 2.4 guarantees the existence of a positive integer n = ma.x{Tr(B l ),7v(B 2 )}, 
such that for all integers x > r, either B(x) > 0, or B{x) < 0. Furthermore, it guarantees 
that tt < 2(D + 1)M 2 + 1. Since the expression P(x) > Q{x) is equivalent to B(x) > 0, 
the lemma follows. D 

Next, we consider the case of a computation vertex. First we discuss rational opera- 
tions, and restate Lemma 2.2, for univariate rational expressions. 

Lemma 3.3 Let P and Q be two rational expressions, of degree at most D and max- 
imum coefficient at most M, defined over S(n,X). Let R(x) = P(x) o Q(x), where 
oe { + ,-,*,/}. Then, deg(R)<2D and max-coef[R) < 2(D + 1)M 2 . 
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Essentially, the degree argument uses the fact that a polynomial cannot change "too 
fast". Namely, a polynomial of degree d can change its sign at most d+l times. Therefore 
a polynomial of degree d can "disconnect" the inputs to at most d+l different connected 
components. The number of connected components of the function to be computed 
implies a lower bound on the degree of the polynomial, which implies a lower bound on 
the depth. 

The degree argument can be extended to rational functions, and also in higher 
dimensions. The arguments for higher dimensions are much more complex than for 
one dimension, and they are the main part of the lower bound technique developed in 
[B083, SY82, Yao89]. 

Adding the floor function as a basic operation changes the situation dramatically. 
First, using the floor function, one can trivially compute the parity function in 0(1) 
steps. Second, the floor function may increase the number of connected components in 
an unbounded way. For this reason the number of connected components is not relevant 
in computations with the floor function. 

In the remainder of this section we give a general overview of the proof technique 
that we develop. Given a decision tree, the proof constructs a path from the root to a 
leaf, such that a certain set of inputs follows that path. The property of the set, is that 
it includes all the integers that are divisible by a certain integer A. A main part of the 
proof technique is to bound A as a function of the depth of the decision tree, i.e. to show 
that A < g(h), where h is the depth of the tree. In Section 3.3 we show applications 
of this proof technique. We define a boolean function to be M(n )-invariant if for any 
integer A < M(n), there are at least two n bit integers, both divisible by A, for which 
the value of the function differs. We argue that any M(n)-invariant boolean function 
requires depth at least h, where g(h) = M(n). 

As one can expect, the main part of the proof technique is devoted to handling the 
floor function. The main idea is to transform the floor function to a rational function. 
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For univariate polynomial the situation is even simpler, and is summarized in the 
following corollary of Lemma 2.5. 

Corollary 2.7 Let P(x) and Q(x) be two polynomials with integer coefficients, of 
degree at most D, and maximum coefficient at most M. Let L be the leading coefficient 
of Q{x), then there exists A(x) and R(x) such that, 

P( x ) = I L T A(x)Q(x)+ I ^R(x), 

where A(x) and R(x) are polynomials with integer coefficients, and max-coef(A) < 2 D M D+l , 
max-coef{R) < 2 D + 1 M D+2 , deg(A) < D, and deg(R) < D. 
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R(x, y) such that 

p ( x >y) = jj^ A(x,y)Q(x,y)+ —R{x,y), 
where A(x,y) and R(x,y) are polynomials with integer coefficients. Furthermore, 

1. max-coeJIR) < (2 + deg y (Q)) s+1 MN s+1 , 

2. deg x (R) < deg x (Q){= d), and deg y (R) < deg y (P) + (S + l)deg y (Q). 

3. deg x (A) < max{0,6} ; and deg y (A) < deg y (P), and 

4. max-coef[A) < (2 + deg y {Q)) 5 M N 6 . 

Proof: The proof is by induction on 8. The hypothesis holds for the basis case 8 = — 1 
with A(x,y) = and R(x,y) = P(x,y). 

For the induction step, assume that the hypothesis holds for all 8 < k, for some 
k > -1. We prove it for k. Let P(x,y) = pi(y)x e + p 2 (y)x e - 1 + . . ., be such that 
k — e — d. Consider the polynomial 

S(x,y) = LP(x,y)-x k Pl ( y )Q(x,y). 

One can verify that, max-coej{S) < (2 + deg y (Q))MN, deg x {S) < deg x {P) - 1, and 
deg y (S) < deg y (P) + deg y (Q). 

Applying the hypothesis to the pair S(x,y) and Q(x,y), yields 
S(x, y) = jjA'(x, y)Q(x, y) + j~ 5 R\x, y). 
Substituting for S(x,y), we get 

p i*, V) = JJ^ ( A '(x, V) + L 6 x k Pl (y)) Q(x, y) + j^R'(x, y). 
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Bivariate and univariate polynomials 

A bivariate polynomial is a polynomial with two variables. Below, we relate the lexico- 
graphic order defined on the bivariate polynomials, and the order among their values at 
certain points. We concentrate on bivariate and univariate polynomials since those are 
the cases that we would be interested in later. 

Consider a bivariate polynomial P(x,y). We would like to give a simple sufficient 
condition on the inputs of P, such that for any input (x,y) that satisfies the condition, 
the sign of P(x, y) would be the same as sign(P). The sufficient condition that we give in 
the lemma below guarantees that the value of the leading monomial is larger than all other 
monomials combined. This is clearly sufficient to ensure that sign(P(a,b)) = sign(P), 
for such inputs (a, b). 

Lemma 2.3 For each bivariate polynomial P(x,y), such that sign(P) ^ 0, there 
exist positive integers X\(P) and ir 2 (P) such that for all (a,b) satisfying a > b ri( - p) and 

b > *2(P), sign(P(a,b)) equals sign(P). Furthermore, tt^P) < deg y (P) + l, and tt 2 (P) < 

max-coef(p)+ \L\ , r . ., , ,. „ . ± , _, 
i^| — — — '-, where L is the leading coefficient of P. 

Proof: Let M = max-coef{P). Let P(x,y) = T^= Q L k x ik y j \ where P(x,y) is written in 
its normal form, and L k is the coefficient of the fc tn monomial (L = L). Note that since 
sign(P) ^ 0, then L ^ 0. Denote t k (x,y) = ar'V*- Let tt x (P) = l+maxo<*< m _i{0, j fc+ i- 
j k }, and tt 2 (P) = M±L m Clearly, ^(P) < deg y (P) + 1. 

From the lexicographic order it follows that tk £^$ = x'y', where either i < and j < 
ttx(P) or i = and j < 0. Thus, if a > 6*»C> then *-*$$ < h and hence > $f$ < &■ 

Suppose that L > 0, i.e. sign(P) = +1. We show that for (a, b), satisfying a > b* 1 ^ 
and b > tt 2 {P), P(a,b) > 0, i.e. sign{P{a,b)) = +1. For (a, 6), such that a > b^ p \ 



m m I r I 

P(a, b)>L t Q (a, b)-J2 \L k W*V> > L t Q (a, b)(l - £ ££). 



fc=i fc=i 
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Proof: We only prove the bound on max-coef(P*Q). (The other bounds are straightfor- 
ward.) Note that a multivariate polynomial of degree d has at most (1 + d) k monomials. 
Therefore, when we multiply two multivariate polynomials, each of the coefficients of the 
product is the sum of at most (1 + m'm{deg(P), deg(Q)}) k terms, each of them being 
the product of a coefficient in P by a coefficient in Q. Since the product of a coeffi- 
cient of P and a coefficient of Q is bounded by max-coef[P) max-coej{Q), the bound on 
max-coef[P * Q) follows. □ 

A rational number is a number that can be expressed as n/m, where n and m are 
integers. Similarly, a rational expression R(x 1 ,. . . , x k ) can be written as n\ Xl '"" Xk \ , where 
P and Q are polynomials. 

For a rational expression R(x u ...,x k ) = Jjf^f^, where P and Q are multivariate 
polynomials, define the degree of R to be the larger of the degrees of P and Q. Similarly, 
define the maximum coefficient of R to be the larger of the maximum coefficients of P 
and Q. 

Lemma 2.2 Let R(x u . . . , x k ) = gfe*{ and S(x u . . . ,x k ) = gfc;;;j be two 
multivariate rational expressions. Then, 

1. deg(R ±S)< deg(R) + deg(S), 

2. max-coef(R ± S) < 2(1 + mm{deg(R), deg{S)}) k max-coef[R)max-coef{S), 

3. deg{R * S) < deg(R) + deg(S), and 

4. max-coef{R * S) < (1 + min{deg(R), deg(S)}) k max-coeJ{R)max-coef(S). 

Proof: The bounds follow from Lemma 2.1 and from the fact that for o e {+,-}, 
RoS= p rt*°W . □ 
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The computation terminates at a leaf u and outputs the value of u. The time com- 
plexity of a given input is the length of the path that the computation traverses using 
that input. The time complexity of a computation tree is the maximum time complexity 
over all inputs, which is equal to the depth of the tree 1 , assuming that each vertex can 
be reached by some computation. 

As one can note, we did not mention how constants appear in the tree. We can 
introduce constants as additional operations, and view a constant as a constant function. 
In many cases it is simpler to state separately the basic operations and the allowed 
constants. However, the formal interpretation of such a statement is that there is one set 
of basic operations, that includes the constants and basic operations. 

A decision tree is a computation tree whose output values are either or 1. For 
decision trees we can assume that each leaf is labeled by a constant, either or 1. This 
can increase the depth of the tree by an additive factor of at most two. 

A straight line program is a computation tree that does not include any comparison 
vertices. 

The operations { + , -, *} are defined in the natural way. The rational division opera- 
tion returns a rational function of the inputs, e.g. 2/3 = 0.666 • • •. (In case a division by 
zero occurs during a computation, the output of the entire computation is undefined.) 
The set of rational operations includes the operations {+,-,*,/}. We define the floor 
operation in the following way. The floor operation receives as an operand a non-negative 
real number and returns the largest integer smaller than or equal to the operand. 

Our assumption that the operand to the floor function is non-negative simplifies our 
proofs and does not change the depth of the computation tree by more than a constant 
factor. Using this floor operation, one can implement either a general floor operation 
(i.e. the operand is an arbitrary real number) or a mod operation using 0(1) depth. 



lr The depth of a tree is the length of a longest path from the root to a leaf. 
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However, this is not the case in general. In most cases we would desire some con- 
ditional branching mechanism. The time complexity of a program that includes such a 
mechanism cannot be estimated by the number of instructions that appear in the pro- 
gram. This is best illustrated in the following example. Consider the following program: 



z := 

WHILE y > 1 DO 

z := z -f x 

y := y- l 
END WHILE 



For y e [1, n], the complexity of the above procedure is 0(n) operations. The follow- 
ing program, that looks very similar requires only O(logn) operations. 



z := 

WHILE y > 1 DO 

z := z + x 

y := y/2 

END WHILE 



In order to avoid such problems, we "unroll" the loop structure. This means that the 
program is represented as a tree, possibly an infinite tree. The time complexity of a given 
input is the length of the path in the tree traversed by the input. We would consider 
only finite trees. 

After the above motivation, we can give a precise definition of the computation tree 
model. 
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Unlike much of the pr s research, our work concentrates on functions with a 

constant number of input variables. This allows us to get a better understanding of the 
complexities that are involved with the floor operation. Since there are only a constant 
number of inputs, we define the complexity measures as a function of the complexity of 
the input, e.g., the number of bits that represent an integer input. We develop a general 
technique for deriving lower bounds for such functions. In combination with the lower 
bounds, we show upper bounds that use the floor function in new and novel ways. In 
some cases we are able to derive matching lower and upper bound, up to a constant 
multiplicative factor. 

We develop a general technique to handle functions of a single rc-bit integer input. 
Using this technique we derive Q(,/log?z) lower bounds for problems such as computing 
[log log x\ , deciding if the input is a perfect square (i.e. whether its square root is also an 
integer), and other problems. For computing [log log a?J, we give a 0(y/\ogn) algorithm, 
and thus show that this lower bound is tight. (Note that using only rational operations, 
computing [log log x\ requires Q(logn) operations.) 

We extend this technique to handle two variable functions, and show that computing 
the greatest common divisor of two integers requires a non-constant number of operations. 
More precisely, we show an fi(loglogn) lower bound for computing the greatest common 
divisor of two n-bit integers. This lower bound holds even if we restrict our attention to 
deciding if a pair of integers are relatively prime or not. 

We also develop a lower bound technique for approximating real valued functions. 
We consider approximating the s ik root of a real number from the interval [1,2], on 
the "average", where s is a fixed constant. We define an average approximation by 
considering the function which gives the absolute difference between the approximated 
value and the correct value, for each x <E [1,2], and integrating this function over the 
interval [1,2]. The value of this integral is the "average error" of the approximation. 
We show that in order to approximate the s ih root of an input from [1,2], such that 
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At the beginning of this decade, a general technique, based on topological arguments, 
was developed in [B083, SYS2]. The technique relates the logarithm of the number 
of connected components of a decision problem and the number of rational operations 
the problem requires. Based on this technique, many problems, whose inputs are either 
real or rational numbers, have been given tight bounds. This technique, under some 
restrictions, was extended to handle integral input by [Yao89]. 

Another model that uses abstract arithmetic operations, rather than explicit bit op- 
erations, is the strongly polynomial model. The motivation for this model is to achieve 
polynomial time algorithms that would not depend on the representation of the input. 
This would allow inputs with infinite representation, as in the case of real numbers. The 
aim is that the number of operations would be polynomial in the number of inputs, and 
not depend on the size of the input. Grotschel, Lovasz, and Schrijver, in their book "Ge- 
ometric Algorithms and Combinatorial Optimization", asked if there exists a strongly 
polynomial algorithm for the greatest common divisor of two integers. (See [GLS88], 
pp. 32-33, p. 225.) Notice that since there are only two integer inputs to the problem, 
any strongly polynomial algorithm for this problem must have a constant number of 
arithmetic operations. 

There are two "standard" sets of operations that are considered in strongly polynomial 
computations. The weaker model considers only rational operations, i.e. {+,—,*,/}. 
The more powerful model has an additional "rounding down" operation (i.e. floor). 
Stockmeyer [Sto76] proves an Q(n) lower bound for deciding if an n-bit integer is odd 
or even, in the weaker model, where only rational operations are allowed. We show in 
this work that there is no strongly polynomial algorithm for the greatest common divisor 
problem in the stronger model, where rounding operations are allowed. Our result implies 
that strongly polynomial time is different from polynomial time. 

When defining strongly polynomial computations one normally restricts the size of 
the integers involved in the computation to have size polynomial in the size of the original 
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common divisor of two integers requires a non-constant number of operations; such 
a lower bound would separate polynomial time and strongly polynomial time. 

We resolve this open problem and show that there is no computation tree that 
computes the greatest common divisor in a fixed number of operations; in fact, 
we prove an Q(loglogn) lower bound for computing the greatest common divisor 
of two n-bit integers. Thus separating polynomial time and strongly polynomial 
time. 

3. Approximation problems. In this category we consider the complexity of computing 
an approximation to the s th root of a real number. The notion of approximation 
that we cons ider is "on the average", which is modeled by an L x norm. We show a 
^(y log log 7) lower bound for approximating the s ih root of inputs in the interval 
[1,2], within e on the average. (The inputs to the computation, in this case, is the 
number for which the s tk root is approximated.) 

We also show interesting upper bounds. Using Newton's method one can approx- 
imate the s th root in O(loglogi) rational operations, and this bound is tight (for 
rational operations). Using the floor f unction, one can accelerate the computation, 
and compute an approximation in Q(^loglogi) operations. (The input to the com- 
putation, in this case, are both the number for which the s ih root is approximated, 
and the error parameter e.) 
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